MDK1 polypeptides

ABSTRACT

The present invention relates to MDK1 polypeptides, nucleic acids encoding such polypeptides, cells, tissues and animals containing such nucleic acids, antibodies to such polypeptides, assays utilizing such polypeptides, and methods relating to all of the foregoing. 
     Methods for treatment, diagnosis, and screening are provided for diseases or conditions characterized by an abnormality in a signal transduction disorder. The signal transduction pathway involves an interaction between a MDK1 receptor tyrosine kinase and a receptor for the kinase. The MDK1 receptor tyrosine kinase may be truncated and lack a kinase domain and may be selected from the group consisting of MDK1.T1, MDK1.T2, MDK1.Δ1 and MDK1.Δ2.

This is a continuation of application Ser. No. 08/368,776, filed Jan. 3,1995.

FIELD OF THE INVENTION

The present invention relates generally to the field of cellular signaltransduction and more specifically to the diagnosis and treatment ofvarious diseases and conditions associated with abnormal cellular signaltransduction pathways.

BACKGROUND OF THE INVENTION

The present invention concerns methods for diagnosis and treatment ofdisorders characterized by abnormal cellular signal transduction. Thefollowing is a discussion of relevant art, none of which is admitted tobe prior art to the invention.

Cellular signal transduction is a fundamental mechanism whereby externalstimuli that regulate diverse cellular processes are relayed to theinterior of cells. One of the key biochemical mechanisms of signaltransduction involves the reversible phosphorylation of tyrosineresidues on proteins. The phosphorylation state of a protein is modifiedthrough the reciprocal actions of tyrosine kinases (TKs) and tyrosinephosphatases (TPs).

Receptor tyrosine kinases (RTKs) belong to a family of transmembraneproteins and have been implicated in cellular signaling pathways. Thepredominant biological activity of some RTKs is the stimulation of cellgrowth and proliferation, while other RTKs are involved in arrestinggrowth and promoting differentiation. In some instances, a singletyrosine kinase can inhibit, or stimulate, cell proliferation dependingon the cellular environment in which it is expressed.

RTKs are composed of at least three domains: an extracellular ligandbinding domain, a transmembrane domain and a cytoplasmic catalyticdomain that can phosphorylate tyrosine residues. Ligand binding tomembrane-bound receptors induces the formation of receptor dimers andallosteric changes that activate the intracellular kinase domains andresult in the self-phosphorylation (autophosphorylation and/ortransphosphorylation) of the receptor on tyrosine residues. Theirintrinsic tyrosine kinase is activated upon ligand binding, therebyinitiating a complex signal transduction pathway that begins withreceptor autophosphorylation and culminates in the tyrosinephosphorylation of a variety of cellular substrates and ultimately inthe initiation of nuclear events necessary for the overall cellresponse. Individual phosphotyrosine residues of the cytoplasmic domainsof receptors may serve as specific binding sites that interact with ahost of cytoplasmic signaling molecules, thereby activating varioussignal transduction pathways.

The intracellular, cytoplasmic, non-receptor protein tyrosine kinases donot contain a hydrophobic transmembrane domain or an extracellulardomain and share non-catalytic domains in addition to sharing theircatalytic kinase domains. Such non-catalytic domains include the SH2domains (SRC homology domain 2) and SH3 domains (SRC homology domain 3).The non-catalytic domains are thought to be important in the regulationof protein-protein interactions during signal transduction.

A central feature of signal transduction (for reviews, see Posada andCooper, Mol. Biol. Cell 3:583-392, 1992; Hardie, Symp. Soc. Exp. Biol.44:241-255, 1990), is the reversible phosphorylation of certainproteins. Receptor phosphorylation stimulates a physical association ofthe activated receptor with target molecules. Some of the targetmolecules such as phospholipase Cγ are in turn phosphorylated andactivated. Such phosphorylation transmits a signal to the cytoplasm.Other target molecules are not phosphorylated, but assist in signaltransmission by acting as adapter molecules for secondary signaltransducer proteins. For example, receptor phosphorylation and thesubsequent allosteric changes in the receptor recruit the Grb-2/SOScomplex to the catalytic domain of the receptor where its proximity tothe membrane allows it to activate ras.

The secondary signal transducer molecules generated by activatedreceptors result in a signal cascade that regulates cell functions suchas cell division or differentiation. Reviews describing intracellularsignal transduction include Aaronson, Science, 254:1146-1153, 1991;Schlessinger, Trends Biochem. Sci., 13:443-447, 1988; and Ullrich andSchlessinger, Cell, 61:203-212, 1990.

RTKs are important regulators of developmental processes, as reflectedby the high level of tyrosine phosphorylation in the early mouse embryo,which decreases with progressing development and is low in adult animaltissues (Pasquale and Singer, Proc. Natl. Acad. Sci. USA 88:5449-5453,1989). For example, the mouse c-kit proto-oncogene plays a key role inthe migrational behavior of specific cell types in mouse development(Chabot et al., Nature 335:88-89, 1988; Geissler et al., Cell55:185-192, 1988; Nocka et al., Genes Dev. 3:816-826, 1989).

Disruption of the platelet-derived growth factor receptor α (PDGF-Rα)gene is responsible for the mouse patch mutation, which is characterizedby prominent anatomical abnormalities in homozygotes (Stephenson et al.,Proc. Natl. Acad. Sci. USA 88:6-10, 1991). Moreover, Flk-1, the cognatereceptor for the vascular endothelial growth factor (VEGF), was shown tobe a major regulator of vasculogenesis and angiogenesis (Millauer etal., Cell 72:835-846, 1993). Finally, in Drosophila, the RTK sevenlesshas a well established function in the control of photoreceptor cellfate (Basler and Hafen, Science 243:931-934, 1989), as does the RTKtorso in the formation of terminal structures of Drosophila larva(Sprenger et al., Nature 338:478-483, 1989).

Among adult tissues, the brain contains the highest level of proteinkinase activity, comparable to that found in embryonic tissues (Maher,P. A., J. Cell. Biol. 112:955-963, 1991). Members of the trk family ofRTKs have well documented roles in promoting the differentiation andsurvival of diverse groups of neurons of the central and peripheralnervous systems (reviewed in Raffioni et al., Annu. Rev. Biochem.62:823-850, 1993). The eck/eph RTK subfamily (Hirai et al., Science238:1717-1720, 1987) currently comprises the largest subgroup of RTKs(Sajjadi and Pasquale, Oncogene 8:1807-1813, 1993), with most membersbeing expressed in the developing or adult brain.

While RTKs such as eck (Lindberg and Hunter, Mol. Cell. Biol.10:6316-6324, 1990), Hek2 (Böhme et al., Oncogene 8:2857-2862, 1993),Cek6, Cek9, and Cek10 (Sajjadi and Pasquale, Oncogene 8:1807-1813, 1993)have been reported to be widely expressed in a variety of tissues, Elkand Cek5 transcripts have been found predominantly in the brain (Letwinet al., Oncogene 3:621-627, 1988; Pasquale et al., J. Neuroscience12:3956-3967, 1992).

As first noted by Maisonpierre et al. (Maisonpierre et al., Oncogene8:3277-3288, 1993), there is a subclass of RTKs within the eck/ephfamily which, while being strongly expressed in the brain, are alsofound in other tissues, especially during embryogenesis. This subfamilyincludes Ehk-1, Ehk-2, (Maisonpierre et al., Oncogene 8:3277-3288,1993), Mek4, Cek4, Hek (Sajjadi et al., New Biol. 3:769-778, 1991; Wickset al., Proc. Natl. Acad. Sci. USA 89:1611-1615, 1992), eek (Chan andWatt, Oncogene 6:1057-1061, 1991), Sek (Nieto et al., Development116:1137-1150, 1992; Gilardi-Hebenstreit et al., Oncogene 7:2499-2506,1992), Cek7 and Cek8 (Sajjadi and Pasquale, Oncogene 8:1807-1813, 1993),whose members are more related to each other than to either of theabove-mentioned kinases.

SUMMARY OF THE INVENTION

The present invention relates to MDK1 polypeptides, nucleic acidsencoding such polypeptides, cells, tissues and animals containing suchnucleic acids, antibodies to such polypeptides, assays utilizing suchpolypeptides, and methods relating to all of the foregoing. Inparticular, this invention relates to methods for diagnosis andtreatment of a disorder, most preferably a disorder characterized by anabnormality in a signal transduction pathway, wherein the signaltransduction pathway involves the interaction between a MDK1 receptortyrosine kinase and a MDK1 binding partner.

The present invention is based upon the isolation and characterizationof a new member of the sub-group of the eck/eph family of RTKs referredto above, which we have designated mouse developmental kinase 1 (MDK1).MDK1, which was found using a polymerase chain reaction (PCR) basedapproach, exhibits complex transcriptional regulation and is expressedin at least five different forms. Along with two variants containingamino acid deletions in the membrane-proximal extracellular domain andthe juxtamembrane region, we also identified two truncated versions ofMDK1 which lack the catalytic kinase domain. Although MDK1 istranscribed in a variety of tissues in early stages of development, itis found exclusively in the brain, spleen, and testes of adult mice. Theneuronal expression sites characterized indicate an important role forMDK1 in the development of the nervous system.

In addition, we have determined that disruption or promotion of theinteraction between a MDK1 receptor tyrosine kinase and MDK1 bindingpartner is useful in therapeutic procedures. Thus, we have determinedthat a kinase, termed MDK1, is involved in a protein-protein interactionof therapeutic importance. This interaction is associated with the basicsignalling function of proteins associated with various diseases orconditions. MDK1 polypeptides are involved in various signaltransduction pathways and thus the present invention provides severalagents and methods useful for diagnosing, treating, and preventingvarious diseases or conditions associated with abnormalities in thesepathways.

Thus, in a first aspect the invention features an isolated, enriched, orpurified nucleic acid encoding a MDK1 polypeptide.

By “isolated” in reference to nucleic acid is meant a polymer of 2(preferably 21, more preferably 39, most preferably 75) or morenucleotides conjugated to each other, including DNA or RNA that isisolated from a natural source or that is synthesized. The isolatednucleic acid of the present invention is unique in the sense that it isnot found in a pure or separated state in nature. Use of the term“isolated” indicates that a naturally occurring sequence has beenremoved from its normal cellular environment. Thus, the sequence may bein a cell-free solution or placed in a different cellular environment.The term does not imply that the sequence is the only nucleotide chainpresent, but that it is essentially free (about 90-95% pure at least) ofnon-nucleotide material naturally associated with it and thus is meantto distinguish from isolated chromosomes.

By the use of the term “enriched” in reference to nucleic acid is meantthat the specific DNA or RNA sequence constitutes a significantly higherfraction (2-5 fold) of the total DNA or RNA present in the cells orsolution of interest than in normal or diseased cells or in the cellsfrom which the sequence was taken. This could be caused by a person bypreferential reduction in the amount of other DNA or RNA present, or bya preferential increase in the amount of the specific DNA or RNAsequence, or by a combination of the two. However, it should be notedthat enriched does not imply that there are no other DNA or RNAsequences present, just that the relative amount of the sequence ofinterest has been significantly increased. The term significant here isused to indicate that the level of increase is useful to the personmaking such an increase, and generally means an increase relative toother nucleic acids of about at least 2 fold, more preferably at least 5to 10 fold or even more. The term also does not imply that there is noDNA or RNA from other sources. The other source DNA may, for example,comprise DNA from a yeast or bacterial genome, or a cloning vector suchas pUC19. This term distinguishes from naturally occuring events, suchas viral infection, or tumor type growths, in which the level of onemRNA may be naturally increased relative to other species of mRNA. Thatis, the term is meant to cover only those situations in which a personhas intervened to elevate the proportion of the desired nucleic acid.

It is also advantageous for some purposes that a nucleotide sequence bein purified form. The term “purified” in reference to nucleic acid doesnot require absolute purity (such as a homogeneous preparation);instead, it represents an indication that the sequence is relativelypurer than in the natural environment (compared to the natural levelthis level should be at least 2-5 fold greater, e.g., in terms ofmg/ml). Individual clones isolated from a cDNA library may be purifiedto electrophoretic homogeneity. The claimed DNA molecules obtained fromthese clones could be obtained directly from total DNA or from totalRNA. The cDNA clones are not naturally occurring, but rather arepreferably obtained via manipulation of a partially purified naturallyoccurring substance (messenger RNA). The construction of a cDNA libraryfrom mRNA involves the creation of a synthetic substance (cDNA) and pureindividual cDNA clones can be isolated from the synthetic library byclonal selection of the cells carrying the cDNA library. Thus, theprocess which includes the construction of a cDNA library from mRNA andisolation of distinct cDNA clones yields an approximately 10⁶-foldpurification of the native message. Thus, purification of at least oneorder of magnitude, preferably two or three orders, and more preferablyfour or five orders of magnitude is expressly contemplated.

By “a MDK1 polypeptide” is meant 2 (preferably 7, more preferably 13,most preferably 25) or more contiguous amino acids set forth in the fulllength amino acid sequence of SEQ ID NO:2, or a functional derivativethereof as described herein. The MDK1 polypeptide can be encoded by afull-length nucleic acid sequence or any portion of the full-lengthnucleic acid sequence, so long as a functional activity of thepolypeptide is retained. Examples of partial amino acid sequences areshown in SEQ ID NOS 3 and 5.

In preferred embodiments the isolated nucleic acid comprises, consistsessentially of, or consists of a nucleic acid sequence set forth in thefull length nucleic acid sequence SEQ ID NO:1, a functional derivativethereof, or at least 27, 30, 35, 40 or 50 contiguous nucleotidesthereof; the MDK1 polypeptide comprises, consists essentially of, orconsists of at least 9, 10, 15, 20, or 30 contiguous amino acids of aMDK1 polypeptide. The nucleic acid may be isolated from a natural sourceby cDNA cloning or subtractive hybridization; the natural source may beblood, semen, and tissue of various organisims including eukaryotes,mammals, birds, fish, plants, gorillas, rhesus monkeys, chimpanzees andhumans; and the nucleic acid may be synthesized by the triester methodor by using an automated DNA synthesizer. In yet other preferredembodiments the nucleic acid is a conserved or unique region, forexample those useful for the design of hybridization probes tofacilitate identification and cloning of additional polypeptides, thedesign of PCR probes to facilitate cloning of additional polypeptides,and obtaining antibodies to polypeptide regions. Examples of partialnucleic acid sequences are shown in SEQ ID NOS 4 and 6.

By “conserved nucleic acid regions”, are meant regions present on two ormore nucleic acids encoding a MDK1 polypeptide, to which a particularnucleic acid sequence can hybridize to under lower stringencyconditions. Examples of lower stringency conditions suitable forscreening for nucleic acid encoding MDK1 polypeptides are provided inAbe, et al. J. Biol. Chem., 19:13361 (1992) (hereby incorporated byreference herein in its entirety, including any drawings). Preferably,conserved regions differ by no more than 7 out of 20 nucleotides.

By “unique nucleic acid region” is meant a sequence present in a fulllength nucleic acid coding for a MDK1 polypeptide that is not present ina sequence coding for any other naturally occurring polypeptide. Suchregions preferably comprise 12 or 20 contiguous nucleotides present inthe full length nucleic acid encoding a MDK1 polypeptide.

The invention also features a nucleic acid probe for the detection of aMDK1 polypeptide or nucleic acid encoding a MDK1 polypeptide in asample. The nucleic acid probe contains nucleic acid that will hybridizeto a sequence set forth in SEQ ID NO:1 or a functional derivativethereof.

In preferred embodiments the nucleic acid probe hybridizes to nucleicacid encoding at least 12, 27, 30, 35, 40 or 50 contiguous amino acidsof the full-length sequence set forth in SEQ ID NO:2 or a functionalderivitive thereof. Various low or high stringency hybridizationconditions may be used depending upon the specificity and selectivitydesired. Under stringent hybridization conditions only highlycomplementary nucleic acid sequences hybridize. Preferably, suchconditions prevent hybridization of nucleic acids having 1 or 2mismatches out of 20 contiguous nucleotides.

Methods for using the probes include detecting the presence or amountMDK1 RNA in a sample by contacting the sample with a nucleic acid probeunder conditions such that hybridization occurs and detecting thepresence or amount of the probe bound to MDK1 RNA. The nucleic acidduplex formed between the probe and a nucleic acid sequence coding for aMDK1 polypeptide may be used in the identification of the sequence ofthe nucleic acid detected (for example see, Nelson et al., inNonisotopic DNA Probe Techniques, p. 275 Academic Press, San Diego(Kricka, ed., 1992) hereby incorporated by reference herein in itsentirety, including any drawings). Kits for performing such methods maybe constructed to include a container means having disposed therein anucleic acid probe.

The invention also features recombinant nucleic acid, preferably in acell or an organism. The recombinant nucleic acid may contain a sequenceset forth in SEQ ID NO:1 or a functional derivative thereof and a vectoror a promoter effective to initiate transcription in a host cell. Therecombinant nucleic acid can alternatively contain a transcriptionalinitiation region functional in a cell, a sequence complimentary to anRNA sequence encoding a MDK1 polypeptide and a transcriptionaltermination region functional in a cell.

In another aspect the invention features an isolated, enriched, orpurified MDK1 polypeptide.

By “isolated” in reference to a polypeptide is meant a polymer of 2(preferably 7, more preferably 13, most prefereably 25) or more aminoacids conjugated to each other, including polypeptides that are isolatedfrom a natural source or that are synthesized. The isolated polypeptidesof the present invention are unique in the sense that they are not foundin a pure or separated state in nature. Use of the term “isolated”indicates that a naturally occurring sequence has been removed from itsnormal cellular environment. Thus, the sequence may be in a cell-freesolution or placed in a different cellular environment. The term doesnot imply that the sequence is the only amino acid chain present, butthat it is essentially free (about 90-95% pure at least) of non-aminoacid material naturally associated with it.

By the use of the term “enriched” in reference to a polypeptide is meantthat the specific amino acid sequence constitutes a significantly higherfraction (2-5 fold) of the total of amino acids present in the cells orsolution of interest than in normal or diseased cells or in the cellsfrom which the sequence was taken. This could be caused by a person bypreferential reduction in the amount of other amino acids present, or bya preferential increase in the amount of the specific amino acidsequence of interest, or by a combination of the two. However, it shouldbe noted that enriched does not imply that there are no other amino acidsequences present, just that the relative amount of the sequence ofinterest has been significantly increased. The term significant here isused to indicate that the level of increase is useful to the personmaking such an increase, and generally means an increase relative toother amino acids of about at least 2 fold, more preferably at least 5to 10 fold or even more. The term also does not imply that there is noamino acid from other sources. The other source amino acid may, forexample, comprise amino acid encoded by a yeast or bacterial genome, ora cloning vector such as pUC19. The term is meant to cover only thosesituations in which man has intervened to elevate the proportion of thedesired nucleic acid.

It is also advantageous for some purposes that an amino acid sequence bein purified form. The term “purified” in reference to a polypeptide doesnot require absolute purity (such as a homogeneous preparation);instead, it represents an indication that the sequence is relativelypurer than in the natural environment (compared to the natural levelthis level should be at least 2-5 fold greater, e.g., in terms ofmg/ml). Purification of at least one order of magnitude, preferably twoor three orders, and more preferably four or five orders of magnitude isexpressly contemplated. The substance is preferably free ofcontamination at a functionally significant level, for example 90%, 95%,or 99% pure.

In preferred embodiments the MDK1 polypeptide contains at least 9, 10,15, 20, or 30 contiguous amino acids of the full-length sequence setforth in SEQ ID NO:2, or a functional derivitive thereof.

In yet another aspect the invention features an antibody (e.g., amonoclonal or polyclonal antibody) having specific binding affinity to aMDK1 polypeptide. The antibody contains a sequence of amino acids thatis able to specifically bind to a MDK1 polypeptide. By “specific bindingaffinity” is meant that the antibody binds to MDK1 polypeptides withgreater affinity than it binds to other polypeptides under specifiedconditions.

Antibodies having specific binding affinity to a MDK1 polypeptide may beused in methods for detecting the presence and/or amount of a MDK1polypeptide is a sample by contacting the sample with the antibody underconditions such that an immunocomplex forms and detecting the presenceand/or amount of the antibody conjugated to the MDK1 polypeptide.Diagnostic kits for performing such methods may be constructed toinclude a first container means containing the antibody and a secondcontainer means having a conjugate of a binding partner of the antibodyand a label.

In another aspect the invention features a hybridoma which produces anantibody having specific binding affinity to a MDK1 polypeptide. By“hybridoma” is meant an immortalized cell line which is capable ofsecreting an antibody, for example a MDK1 antibody. In preferredembodiments the MDK1 antibody comprises a sequence of amino acids thatis able to specifically bind a MDK1 polypeptide.

Another aspect of the invention features a method of detecting thepresence or amount of a compound capable of binding to a MDK1polypeptide. The method involves incubating the compound with a MDK1polypeptide and detecting the presence or amount of the compound boundto the MDK1 polypeptide.

Thus, in another aspect, the invention features a method for treatmentof an organism having a disease or condition characterized by anabnormality in a signal transduction pathway, wherein the signaltransduction pathway involves the interaction between a MDK1 receptortyrosine kinase and a MDK1 binding partner. The disorder may also becharacterized by an abnormal level of interaction between MDK1 receptortyrosine kinase and a MDK1 binding partner. The method includesdisrupting or promoting that interaction (or signal) in vivo. The methodalso involves inhibiting or promoting the activity of the complex formedbetween MDK1 receptor tyrosine kinase and a MDK1 binding partner.

By “organism” is meant any living creature. The term includes mammals,and specifically humans. Preferred organisms include mice, as theability to treat or diagnose mice is often predictive of the ability tofunction in other organisms such as humans.

By “disease or condition” is meant a state in an organism, e.g., ahuman, which is recognized as abnormal by members of the medicalcommunity. The disease or condition may be characterized by anabnormality in one or more signal transduction pathways in a cell,preferably a neuronal, fibroblast, epithelial, blood or cancer cell,wherein one of the components of the signal transduction pathway is aMDK1 receptor tyrosine kinase.

Examples of diseases or conditions to be treated or diagnosed by thepresent invention include neurodegenerative disorders,neuroproliferative disorders, cancers, hyperproliferative disorders suchas psoriasis and neurofibromatosis, inflammatory disorders, Alzheimer'sdisease, Parkinson's disease, Lou Gehrig's disease (ALS), trauma,damaged or severed nerve injuries, Huntington's chorea, multiplesclerosis, muscular dystrophy, syringomiplia, Tabes Dorsalis, andcardiovascular accidents. These and other diseases or conditions areoften characterized by one or more of the following symptoms: tumors,astasia, aphasia, paralysis, paresea, and paralagies.

By “abnormality” is meant a level which is statistically different fromthe level observed in organisms not suffering from such a disease orcondition and may be characterized as either an excess amount, intensityor duration of signal or a deficient amount, intensity or duration ofsignal. The abnormality in signal transduction may be realized as anabnormality in neuronal or cancer cell function, viability ordifferentiation state. We have determined that such abnormal interactionin a pathway can be alleviated by action at the MDK1 -binding partnerinteraction site in the pathway.

An abnormal interaction level may also either be greater or less thanthe normal level and may impair the normal performance or function ofthe organism. Thus, it is also possible to screen for agents that willbe useful for treating a disease or condition, characterized by anabnormality in the signal transduction pathway, by testing compounds fortheir ability to affect the interaction between a MDK1 receptor tyrosinekinase and a MDK1 binding partner, since the complex formed by suchinteraction is part of the signal transduction pathway. However, thedisease or condition may be characterized by an abnormality in thesignal transduction pathway even if the level of interaction betweenMDK1 receptor tyrosine kinase and a MDK1 binding partner is normal.

By “interact” is meant any physical association between proteins,whether covalent or non-covalent. Examples of non-covalent bonds includeelectrostatic bonds, hydrogen bonds, and Van der Waals bonds. Stryer,Biochemistry, 1988, pages 7-8. Furthermore, the interactions betweenproteins may either be direct or indirect. Another example of anindirect interaction is the independent production, stimulation, orinhibition of both MDK1 receptor tyrosine kinase and a MDK1 bindingpartner by a regulatory agent. Depending upon the type of interactionpresent, various. methods may be used to measure the level ofinteraction. For example, the strengths of covalent bonds are oftenmeasured in terms of the energy required to break a certain number ofbonds (i.e., kcal/mol) Non-covalent interactions are often described asabove, and also in terms of the distance between the interactingmolecules. Indirect interactions may be described in a number of ways,including the number of intermediary agents involved, or the degree ofcontrol exercised over the MDK1 receptor tyrosine kinase relative to thecontrol exercised over the MDK1 binding partner.

By “MDK1 receptor tyrosine kinase” is meant an amino acid sequencesubstantially similar to the sequence shown in FIG. 1, or fragmentsthereof and is specifically meant to include human equivalents of MDK1.A sequence that is substantially similar will have at least 70% identity(preferably at least 80% and most preferably 90-100%) to the sequence ofFIG. 1 in the ectodomain and at least 85% identity (preferably 90%, mostpreferably 95-100%) in the intracellular domains.

By “identity” is meant a property of sequences that measures theirsimilarity or relationship. Identity is measured by dividing the numberof identical residues by the total number of residues and multiplyingthe product by 100. Thus, two copies of exactly the same sequence have100% identity, but sequences that are less highly conserved and havedeletions, additions, or replacements may have a lower degree ofidentity. MDK1.T1, MDK1.T2, MDK1.Δ1 and MDK1.Δ2 are all examples ofsequences with sufficient identity to the sequence of FIG. 1 to beconsidered a MDK1 receptor tyrosine kinase. Those skilled in the artwill recognize that several computer programs are available fordetermining sequence identity.

By “MDK1 binding partner” is meant an amino acid sequence that interactswith or binds a MDK1 RTK. The term includes ligands and/or substratesfor the MDK1 kinase.

By “disrupt” is meant that the interaction between the MDK1 receptortyrosine kinase and a MDK1 binding partner is reduced either bypreventing expression of the MDK1 receptor tyrosine kinase, or bypreventing expression of the MDK1 binding partner, or by specificallypreventing interaction of the naturally synthesized proteins havingthese domains or by interfering with the interaction of the proteins.

By “promote” is meant that the interaction between a MDK1 receptortyrosine kinase and a MDK1 binding partner is increased either byincreasing expression of a MDK1 receptor tyrosine kinase, or byincreasing expression of a MDK1 binding partner, or by decreasing thedephosphorylating activity of the corresponding regulatory TP (or otherphosphatase acting on other phosphorylated signalling components) bypromoting interaction of the MDK1 receptor tyrosine kinase and a MDK1binding partner or by prolonging the duration of the interaction. Manybivalent or polyvalent linking agents are useful in couplingpolypeptides, such as an antibody, to other molecules. For example,representative coupling agents can include organic compounds such asthioesters, carbodiimides, succinimide esters, diisocyanates,glutaraldehydes, diazobenzenes and hexamethylene diamines. This listingis not intended to be exhaustive of the various classes of couplingagents known in the art but, rather, is exemplary of the more commoncoupling agents. (See Killen and Lindstrom 1984, J. Immunol.133:1335-2549; Jansen, F. K., et al. 1982, Immunological Rev.62:185-216; and Vitetta et al., supra).

By “signal transduction pathway” is meant the sequence of events thatinvolves the transmission of a message from an extracellular protein tothe cytoplasm through a cell membrane. The signal ultimately will causethe cell to perform a particular function, for example, touncontrollably proliferate and therefore cause cancer. Variousmechanisms for the signal transduction pathway (Fry et al., ProteinScience, 2:1785-1797, 1993) provide possible methods for measuring theamount or intensity of a given signal. Depending upon the particulardisease associated with the abnormality in a signal transductionpathway, various symptoms may be detected. Those skilled in the artrecognize those symptoms that are associated with the various otherdiseases described herein. Furthermore, since some adapter moleculesrecruit secondary signal transducer proteins towards the membrane, onemeasure of signal transduction is the concentration and localization ofvarious proteins and complexes. In addition, conformational changes thatare involved in the transmission of a signal may be observed usingcircular dichroism and fluorescence studies.

In a related aspect the invention features a method for screening for anagent useful for treatment of such a disease or condition by assayingpotential agents for the ability to disrupt or promote that interaction.The screening may also involve assaying potential agents for the abilityto remove or reduce the effect of an abnormality in a signaltransduction pathway, wherein the signal transduction pathway contains aMDK1 receptor tyrosine kinase and a MDK1 binding partner.

By “screening” is meant investigating an organism for the presence orabsence of a property. The process may include measuring or detectingvarious properties, including the level of signal transduction and thelevel of interaction between a MDK1 receptor tyrosine kinase and a MDK1binding partner.

Useful agents for treatment of such diseases can be identified bystandard screening protocols in which measurement of such interaction isdetermined. For example, such an agent may be a peptide which eithercomprises, consists of, or consists essentially of a MDK1 receptortyrosine kinase or, alternatively, a fragment thereof.

By “comprising” it is meant including, but not limited to, whateverfollows the word “comprising”. Thus, use of the term “comprising”indicates that the listed elements are required or mandatory, but thatother elements are optional and may or may not be present. By“consisting of” is meant including, and limited to, whatever follows thephrase “consisting of”. Thus, the phrase “consisting of” indicates thatthe listed elements are required or mandatory, and that no otherelements may be present. By “consisting essentially of” is meantincluding any elements listed after the phrase, and limited to otherelements that do not interfere with or contribute to the activity oraction specified in the disclosure for the listed elements. Thus, thephrase “consisting essentially of” indicates that the listed elementsare required or mandatory, but that other elements are optional and mayor may not be present depending upon whether or not they affect theactivity or action of the listed elements.

In preferred embodiments the screening involves looking for agonists orantagonists of a protein of interest, for example MPK1 or a MDK1 bindingpartner. The term agonist refers to agents that bind the protein andthat maintain the activity of the protein to which they bind. Anantagonist competes with the natural ligand for binding the protein, butdoes not maintain the activity of the protein to which it binds.

Another aspect of the invention features a method for diagnosis of sucha disease or condition. The method includes detecting the level ofinteraction between a MDK1 receptor tyrosine kinase and a MDK1 bindingpartner.

By “diagnosis” is meant any method of identifying a symptom normallyassociated with a given disease or condition. Thus, an initial diagnosismay be conclusively established as correct by the use of additionalconfirmatory evidence such as the presence of other symptoms. Currentclassification of various diseases and conditions is constantly changingas more is learned about the mechanisms causing the diseases orconditions. Thus, the detection of an important symptom, such as thedetection of an abnormal level of interaction between the MDK1 receptortyrosine kinases and binding partners for the kinases may form the basisto define and diagnose a newly named disease or condition.

For example, conventional neurological diseases are classified accordingto the presence of a particular set of symptoms. However, a subset ofthese symptoms may both be associated with an abnormality in aparticular signalling pathway, such as the ras²¹ pathway and in thefuture these diseases may be reclassified as ras²¹ pathway diseasesregardless of the particular symptoms observed.

In preferred embodiments the MDK1 receptor tyrosine kinase has conservedcysteine residues in the ectodomain and has FN III domains as shown inFIG. 1, is selected from the group consisting of MDK1.T1, MDK1.T2,MDK1.Δ1 and MDK1.Δ2 as shown in FIG. 2, has a tyrosine residuesubstituted for the phenylanaline residue at position 600, has amolecular weight of 114-120 kD, and has an intracellular domain with amolecular weight of 47 kD. Residues 18-538 defining the extracellulardomain are one example of a fragment, as are other smaller or largersequences. The MDK1 RTK may contain the key amino acids of the catalyticdomain that are highlighted in bold italics in FIG. 1 and have a similarthree dimensional structure to the sequence given in FIG. 1, but mayhave various substitutions, deletions, or additions at non-key residues,as long the sequence still binds the binding partner. In other preferredembodiments the agent is therapeutically effective and has an EC₅₀ orIC₅₀ as described below. An EC₅₀ or IC₅₀ of less than or equal to 5 μMis preferable, and even more preferably less than or equal to 1 μM, 100nmolar, 10 nmolar, or 1 nmolar. Such lower EC₅₀'s or IC₅₀'s areadvantageous since they allow lower concentrations of molecules to beused in vivo or in vitro for therapy or diagnosis. The discovery ofmolecules with such low EC₅₀'s and IC₅₀'s enables the design andsynthesis of additional molecules having similar potency andeffectiveness. In addition, the molecule may have an EC₅₀ or IC₅₀ lessthan or equal to 5 μM at one or more, but not all cells chosen from thegroup consisting of parathyroid cell, bone osteoclast, juxtaglomerularkidney cell, proximal tubule kidney cell, distal tubule kidney cell,cell of the thick ascending limb of Henle's loop and/or collecting duct,central nervous system cell, keratinocyte in the epidermis,parafollicular cell in the thyroid (C-cell), intestinal cell,trophoblast in the placenta, platelet, vascular smooth muscle cell,cardiac atrial cell, gastrin-secreting cell, glucagon-secreting cell,kidney mesangial cell, mammary cell, beta cell, fat/adipose cell, immunecell and GI tract cell.

By “therapeutically effective amount” is meant an amount of apharmaceutical composition having a therapeutically relevant effect. Atherapeutically relevant effect relieves to some extent one or moresymptoms of the disease or condition in the patient; or returns tonormal either partially or completely one or more physiological orbiochemical parameters associated with or causative of the disease orcondition. Generally, a therapeutically effective amount is betweenabout 1 nmole and 1 μmole of the molecule, depending on its EC₅₀ or IC₅₀and on the age and size of the patient, and the disease associated withthe patient.

In a further related aspect, the invention features a method ofidentifying the receptor tyrosine phosphatase responsible fordephosphorylating the activated MDK1 receptor, thereby regulating theMDK1 receptor signaling pathway. Novel methods of treatment of disorders(e.g., neurological disorders) can be based on modulating thisphosphatase activity. Modulation of the RTP activity can be accomplishedin a variety of ways including but not limited to the use of compoundsor drugs that inhibit or enhance the RTP activity, antisense or ribozymeapproaches that “knock out” the RTP activity, or gene therapy approachesto correct defects in the RTP or restore the regulated expression of theRTP. Compounds can be used that specifically modulate the activity ofthe controlling RTP, thereby prolonging or enhancing signal transductionmediated by the MDK1 receptor.

In another aspect the invention features a method for screening forhuman cells containing a MDK1 RTK or an equivalent sequence (i.e., onethat performs a similar function in humans to that played by MDK1 inmice). The method involves identifying the novel RTK in human cellsusing techniques that are routine and standard in the art, such as thosedescribed herein for identifying MDK1 in mouse cells (e.g., cloning,Southern or Northern blot analysis, in situ hybridization, PCRamplification, etc.).

In preferred embodiments the method features screening cells involved inhuman neurological functions, such as nerve cells, for the presence ofMDK1. The invention also features methods of screening human cells forbinding partners of MDK1 RTKs and screening other organisms for MDK1 orthe corresponding binding partner. In other preferred embodiments theagent is therapeutically effective and has an EC₅₀ or IC₅₀ as describedherein.

In other aspects, the invention provides transgenic, nonhuman mammalscontaining a transgene encoding a MDK1 polypeptide or a gene effectingthe expression of a MDK1 polypeptide. Such transgenic nonhuman mammalsare particularly useful as an in vivo test system for studying theeffects of introducing a MDK1 polypeptide, regulating the expression ofa MDK1 polypeptide (i.e., through the introduction of additional genes,antisense nucleic acids, or ribozymes).

A “transgenic animal” is an animal having cells that contain DNA whichhas been artificially inserted into a cell, which DNA becomes part ofthe genome of the animal which develops from that cell. Preferredtransgenic animals are primates, mice, rats, cows, pigs, horses, goats,sheep, dogs and cats. The transgenic DNA may encode for a human MDK1polypeptide. Native expression in an animal may be reduced by providingan amount of anti-sense RNA or DNA effective to reduce expression of thereceptor.

In another aspect, the invention describes a polypeptide comprising arecombinant MDK1 polypeptide or a unique fragment thereof. By “uniquefragment,” is meant an amino acid sequence present in a full-length MDK1polypeptide that is not present in any other naturally occurringpolypeptide. Preferably, such a sequence comprises 6 contiguous aminoacids present in the full sequence. More preferably, such a sequencecomprises 12 contiguous amino acids present in the full sequence. Evenmore preferably, such a sequence comprises 18 contiguous amino acidspresent in the full sequence.

By “recombinant MDK1 polypeptide” is meant to include a polypeptideproduced by recombinant DNA techniques such that it is distinct from anaturally occurring polypeptide either in its location (e.g., present ina different cell or tissue than found in nature), purity or structure.Generally, such a recombinant polypeptide will be present in a cell inan amount different from that normally observed in nature.

In another aspect, the invention describes a recombinant cell or tissuecontaining a purified nucleic acid coding for a MDK1 polypeptide. Insuch cells, the nucleic acid may be under the control of its genomicregulatory elements, or may be under the control of exogenous regulatoryelements including an exogenous promoter. By “exogenous” it is meant apromoter that is not normally coupled in vivo transcriptionally to thecoding sequence for the MDK1 polypeptide.

In another aspect, the invention features a MDK1 polypeptide bindingagent able to bind to a MDK1 polypeptide. The binding agent ispreferably a purified antibody which recognizes an epitope present on aMDK1 polypeptide. Other binding agents include molecules which bind tothe MDK1 polypeptide and analogous molecules which bind to a MDK1polypeptide.

By “purified” in reference to an antibody is meant that the antibody isdistinct from naturally occurring antibody, such as in a purified form.Preferably, the antibody is provided as a homogeneous preparation bystandard techniques. Uses of antibodies to the cloned polypeptideinclude those to be used as therapeutics, or as diagnostic tools.

The summary of the invention described above is non-limiting and otherfeatures and advantages of the invention will be apparent from thefollowing description of the preferred embodiments, and from the claims.

BRIEF DESCRIPTION OF THE DRAWINGS

The nucleotide sequence data reported here have been given accessionnumbers X79082 (MDK1), X79083 (DMK1-T1) and X79084 (MDK1-T2) in theEMBL, GenBank, and DDBJ nucleotide sequence databases.

FIG. 1 shows the nucleotide (as set forth in SEQ ID NO:1) and predictedamino acid sequence (as set forth in SEQ ID NO:2) of a MDK1 RTK. MDK1full-length nucleotide (3628 bp) and deduced amino acid sequence (998amino acids) are shown. The predicted initiating methionine (Kozak, M.,Nucleic Acids Res. 12:857-872, 1984) and signal peptide (Heijne, G. v.,Nucleic Acids Res. 14:4683-4690, 1986) are underlined. Although precededby two putative methionine codons at bases 124 and 226, these codons arefollowed by in-frame stop codons after 4 and 59 amino acids,respectively. In addition, they are surrounded by weak consensussequences for initiation sites, while the proposed initiating methioninecomprises a strong initiation sequence (Kozak, M., Nucleic Acids Res.12:857-872, 1984) preceded by an in-frame stop codon. The putativetransmembrane domain is underlined, the potential N-glycosylation sitesboxed and the conserved extracellular cysteines are circled. Thepolyadenylation motif (AATAAA) (SEQ. I.D. NO. 7) is underlined, thealternative 3′-untranslated region of MDK1 is given below.

FIG. 2 shows the overview of various forms of MDK1 RTKs. FIG. 2A showsthe nucleotide sequence of MDK1.T1 (as set forth in SEQ ID NO:4)beginning with nucleotide 1913 and FIG. 2B shows the nucleotide sequenceof MDK1.T2 (as set forth in SEQ ID NO:6) beginning with nucleotide 1913.The divergent sequence due to alternative splicing is shown underlined,as is the polyadenylation motif (AATAAA) (SEQ ID NO:8) in the sequenceof MDK1.T1.

FIG. 2C shows a schematic representation of MDK1 and its variants. Theopen reading frame is indicated by boxes, the untranslated regions ofthe MDK1 sequences are given in bold lines. Below, the amino acidsequence variations in the marked region of the different forms of MDK1are shown. The missing nucleotide stretches are indicated (---). Thesequences shown each begin at amino acid residue number 535 in MDK1 (SEQID NO:2), MDK1.T1 (SEQ ID NO:3), MDK1.T2 (SEQ ID NO:5), MDK1. Delta 1(SEQ ID NO:11) and MDK1 Delta 2 (SEQ ID NO:12).

FIG. 3 is a dendrogram for the eck/eph subfamily of RTKs. The predictedprotein sequences of MDK1, Hek2 (Böhme et al., 1993), Cek6, 7, 8, 9 and10 (Sajjadi and Pasquale, Oncogene 8:1807-1813, 1993), Elk (Lhotak etal., Mol. Cell. Biol. 11:2496-2502, 1991), Cek5 (Pasquale, E. B., CellRegula. 2:523-534, 1991), Mek4, Cek4 (Sajjadi et al., New Biol.3:769-778, 1991), Hek (Wicks et al., Proc. Natl. Acad. Sci. USA89:1611-1615, 1992.), Ehk1, Ehk2 (Maisonpierre et al., Oncogene8:3277-3288, 1993), Sek (Gilardi-Hebenstreit et al., Oncogene7:2499-2506, 1992), Eek (Chan and Watt, Oncogene 6:1057-1061, 1991), Eck(Lindberg and Hunter, Mol. Cell. Biol. 10:6316-6324, 1990) and eph(Hirai et al., Science 238:1717-1720, 1987) were aligned usingprogressive, pairwise alignments according to the method of Higgins andSharp (Higgins and Sharp, CABIOS 5:151-153, 1989). Published sequencedata for erk (Chan and Watt, Oncogene 6:1057-1061, 1991) and tyrol, 4,5, 6 and 11 (Lai and Lemke, Neuron 6:691-704, 1991) were insufficientfor inclusion in the analysis. A tree of sequence similarity generatedby use of the Unweighted Pair Group Method with Arithmetic meanalgorithm (UPGMA; Sneath and Sokal, in Numerical Taxonomy, W. H. Freemanand Company, San Francisco, 1973, pp. 230-234) calculated on basis ofthe multiple alignment is shown.

DESCRIPTION OF THE PREFERRED EMBODIMENTS

The present invention relates to MDK1 polypeptides, nucleic acidsencoding such polypeptides, cells, tissues and animals containing suchnucleic acids, antibodies to such polypeptides, assays utilizing suchpolypeptides, and methods relating to all of the foregoing.

The present invention is based upon the isolation of MDK1, a new memberof the eck/eph family of RTKs. MDK1 was identified and shown to beclosely related to Eek, Ehk1/Cek7, Ehk2, Cek4/Mek4/hek, and Sek/Cek8subfamily. cDNA cloning using adult mouse brains and Northern blotanalysis revealed MDK1 mRNA transcripts of 6.8, 5.7, 4.0, 3.2, and 2.6kb that encode apparent splice variants and indicate that MDK1 isexpressed in at least five variant forms, including two full-lengthreceptors that display short amino acid deletions in their extra- andintracellular domains. The existence of multiple receptor forms withshort insertion sequences in their juxtamembrane regions and N-terminalof the transmembrane domain has already been described for Ehk-1(Maisonpierre et al., Oncogene 8:3277-3288, 1993), Cek5, Cek6, and Cek10(Sajjadi and Pasquale, Oncogene 8:1807-1813, 1993), other members of thelarge eck/eph RTK subfamily. Further examination of other RTKs of thisfamily may also reveal the generation of variant mRNAs, as suggested bythe identification of a testes-specific transcript for Mek4 (Sajjadi etal., New Biol. 3:769-778, 1991). Northern blot and in situ hybridizationanalysis in the adult mouse indicated that RNA expression is restrictedto brain, testes, and spleen. The distinct patterns of MDK1 expressionduring mouse development suggest an important role for MDK1 in theformation of neuronal structures.

The physiological role of the amino acid deletions in MDK1Δ1 and MDK1Δ2is currently unclear. However, the amino acid exchange of amino acid 600from phenylalanine to cysteine in MDK1Δ2 changes the motif Y⁵⁹⁷FHF toY⁵⁹⁷FHC, which since the first three amino acids following aphosphotyrosine define the binding specificity of src-homology 2(SH2)domain-containing proteins to activated RTKs (Songyang et al., Cell72:767-778, 1993), may result in redefinition of the MDK1 signal andthereby the response of the cell.

The cDNA cloning and Northern blot analysis demonstrate the existence oftwo truncated versions of MDK1 that possess the entire ectodomain, thetransmembrane domain, and part of the juxtamembrane region, but lack thecatalytic tyrosine kinase domain. Similarly, for the closely related RTKMEK4, a putative secreted form has been reported consisting of theectodomain only, although the expression of a corresponding transcriptin any of the tissues examined could not be demonstrated (Sajjadi etal., New Biol. 3:769-778, 1991). Analogous truncated forms of RTKs thatlack the catalytic domain but are still anchored in the cell membranehave been described for trkB (Klein et al., Cell 61:647-656, 1990;Middlemas et al., Mol. Cell. Biol. 11:143-153, 1991), trkC (Valenzuelaet al., Neuron 10:963-974, 1993), the heparin-binding fibroblast growthfactor (HBGF) receptor (Hou et al., Science 251:665-668, 1991), and ltk(Toyoshima et al., Proc. Natl. Acad. Sci. U.S.A. 90:5404-5408, 1993). Inthe case of trkB, expression of this isoform is restricted to theependymal linings of cerebral ventricles and choroid plexus structuresof the mouse forebrain, indicative of a putative role in ligandtransport across the blood-brain barrier (Klein et al., Cell 61:647-656,1990).

While similar findings have been reported for the truncated forms oftrkC (Valenzuela et al., Neuron 10:963-974, 1993), additionalinterpretations of the physiological role of MDK1.T1 and MDK1.T2 aresuggested by their divergence in C-terminal sequences, which mayinteract differentially with cytoplasmic proteins involved in MDK1signal transduction. As suggested by experiments with artificiallytruncated receptors for epidermal growth factor receptor (EGF-R),fibroblast growth factor, and vascular endothelial growth factor, suchmutants may impair or modulate signal transduction by the nativereceptors.

Since RTKs are thought to function as dimers, the formation ofheterodimers between a normal and a truncated mutant receptor preventstransphosphorylation and therefore suppresses the activation of receptorsignaling functions (Kashles et al., Mol. Cell. Biol. 11:1454-1463,1991; Redemann et al., Mol. Cell. Biol. 12:491-498, 1992). Such amodulatory function through dominant negative inhibition for MDK1-T1 andT2 would require expression of dominant negative truncated and thefull-length receptors within the same cell. To address this potentiallyimportant aspect of MDK1 function, the spatial and temporal expressionof MDK1.T1 and MDK1.T2 in comparison with the full-length forms duringmouse development can be investigated.

MDK1 RTKs show autophosphorylation after inhibition of cellularphosphotyrosine phosphatases. Its predicted protein sequence possessesall of the important amino acids conserved in the catalytic domain oftyrosine kinases (Hanks et al., Science 241:42-52, 1988). MDK1 RTKsmigrate as a protein doublet of an apparent molecular weight ofapproximately 114 kD and 120 kD, the larger band corresponding to themajor, glycosylated protein. The lower 114 kD precursor form of MDK1RTKs is probably due to overloading of the processing enzymes in thetransient overexpression system used. The observed sizes are similar tothe apparent molecular weights of 120 kD reported for Cek5 (Pasquale, E.B., Cell Regula. 2:523-534, 1991), 130 kD for eph (Maru et al., Oncogene5:445-447, 1990), and 130 kD for elk (Lhotak et al., Mol. Cell. Biol.11:2496-2502, 1991). Eck has been described to migrate as a doublet of125 kD and 130 kD (Lindberg and Hunter, Mol. Cell. Biol. 10:6316-6324,1990).

A prominent protein of an apparent molecular weight of 47 kD found inall immunoprecipitations with antibodies directed against either theC-terminal amino acids of MDK1 RTKs or phosphotyrosines is believed tocorrespond to the intracellular domain of MDK1 RTKs. It is only found incells transfected with receptor DNA and is not a substrate of MDK1.Protein bands of approximately 60 kD and 53 kD have been detected inimmunoprecipitations of eck (Lindberg and Hunter, Mol. Cell. Biol.10:6316-6324, 1990) and elk (Lhotak et al., Mol. Cell. Biol.11:2496-2502, 1991), respectively.

The situation resembles that found for the colony-stimulating factor 1receptor (CSF1-R). CSF1-R is downregulated through two entirelydifferent mechanisms, one of which makes use of a protein kinase C (PKC)activated protease. The action of this protease results in an inducibleproteolytic cleavage of the CSF1-R near the transmembrane domain,releasing an intracellular fragment containing the kinase domain(Downing et al., Mol. Cell. Biol. 9:2890-2896, 1989). Such a specificproteolytic action on RTKs could downmodulate the activity of thekinases. If such a regulation of MDK1 exists it may be important underphysiological conditions.

As revealed through Northern blot and in situ hybridization analysis,MDK1 displays a rather widespread expression pattern in the early mouseembryo. There is, however, an increasing restriction of MDK1transcription during embryogenesis, resulting in a predominantexpression in the brain of adult mice. Based on its expression sites,MDK1 is likely to be involved in the establishment of the complexneuronal organization of the nervous system, as suggested by itsexpression in key structures of the central nervous system. MDK1 isfound throughout the development of the hippocampal formation and inthalamic structures such as the mammillary body of the hypothalamus andthe habenula of the epithalamus, which are important components of thelimbic system of the CNS. This system is associated with emotionalaspects of behavior related to the survival of the animal and thespecies, together with visceral responses accompanying these emotions.Additionally, the limbic system is thought to participate in theprocesses involved in memory formation (Rohen, J. W., FunktionelleAnatomie des Nervensystems, Schattauer, New York, 1985).

The expression of MDK1 in various sense organs like the ear, the tongueor the vibrissae and in neuronal structures involved in the processingof sensoric signals like the superior colliculus or the trigeminal (V)ganglion could point to an important function of MDK1 in thedifferentiation of the limbic system, since it is connected to nearlyall sensoric organs. Apart from the putative role of MDK1 in theformation of the limbic system, there appears to be a connection betweenthe expression of MDK1 in the Purkinje cell layer of the cerebellum andin the inferior olive of the medulla, since, at least in mammals, thisstructure is the only source of climbing fibers connecting to thePurkinje cells (Ito, M, The cerebellum and neuronal control, RavenPress, New York, 1984).

MDK1 is a new member of a growing subgroup of eck/eph-like RTKs, whichare more related to each other than to any of the remaining eck/eph-likekinases (FIG. 3). In the mouse, the members of this subgroup, MDK1, eek,ehk-1, ehk-2, Mek4, Cek4, Hek, Sek, Cek7, and Cek8, are expressedprimarily in the brain, with Cek7 (Sajjadi and Pasquale, Oncogene8:1807-1813, 1993) and eek (Chan and Watt, Oncogene 6:1057-1061, 1991)being exclusively expressed in this region. The other kinases arereported to have additional regions of transcription in nonneuronaltissues. Transcripts of Mek4 have been detected at low level in testes(Sajjadi et al., New Biol. 3:769-778, 1991). Ehk1 is found faintly inovary and skin, whereas Ehk2 shows weak signals in skin, skeletalmuscle, spleen and thymus (Maisonpierre et al., Oncogene 8:3277-3288,1993). Sek was found to be expressed in specific compartments of thedeveloping hindbrain and in pre-somitic mesoderm and shows supplementaryexpression in heart, lung and kidney (Gilardi-Hebenstreit et al.,Oncogene 7:2499-2506, 1992; Nieto et al., Development 116:1137-1150,1992). Cek8 expression is also detectable in kidney, lung, skeletalmuscle, and thymus (Sajjadi and Pasquale, Oncogene 8:1807-1813, 1993).

The existence of a sub-branch within the eph/eck-family which may haveevolved in parallel with the elaboration of diverse cell types in thevertebrate nervous system has already been suggested by Maisonpierre etal. (Maisonpierre et al., Oncogene 8:3277-3288, 1993). We believe thatthis subgroup is likely to grow as further RTKs are identified and willprove to be a diverse family of related kinases involved in theregulation of the development of the central and peripheral nervoussystems.

I. Nucleic Acid Encoding A MDK1 Polypeptide

Included within the scope of this invention are the functionalequivalents of the herein-described isolated nucleic acid molecules. Thedegeneracy of the genetic code permits substitution of certain codons byother codons which specify the same amino acid and hence would give riseto the same protein. The nucleic acid sequence can vary substantiallysince, with the exception of methionine and tryptophan, the known aminoacids can be coded for by more than one codon. Thus, portions or all ofthe MDK1 gene could be synthesized to give a nucleic acid sequencesignificantly different from that shown in SEQ ID NO: 1. The encodedamino acid sequence thereof would, however, be preserved.

In addition, the nucleic acid sequence may comprise a nucleotidesequence which results from the addition, deletion or substitution of atleast one nucleotide to the 5′-end and/or the 3′-end of the nucleic acidformula shown in SEQ ID NO: 1 or a derivative thereof. Any nucleotide orpolynucleotide may be used in this regard, provided that its addition,deletion or substitution does not alter the amino acid sequence of SEQID NO:2 which is encoded by the nucleotide sequence. For example, thepresent invention is intended to include any nucleic acid sequenceresulting from the addition of ATG as an initiation codon at the 5′-endof the inventive nucleic acid sequence or its derivative, or from theaddition of TTA, TAG or TGA as a termination codon at the 3′-end of theinventive nucleotide sequence or its derivative. Moreover, the nucleicacid molecule of the present invention may, as necessary, haverestriction endonuclease recognition sites added to its 5′-end and/or3′-end.

Such functional alterations of a given nucleic acid sequence afford anopportunity to promote secretion and/or processing of heterologousproteins encoded by foreign nucleic acid sequences fused thereto. Allvariations of the nucleotide sequence of the MDK1 genes and fragmentsthereof permitted by the genetic code are, therefore, included in thisinvention.

Further, it is possible to delete codons or to substitute one or morecodons by codons other than degenerate codons to produce a structurallymodified polypeptide, but one which has substantially the same utilityor activity of the polypeptide produced by the unmodified nucleic acidmolecule. As recognized in the art, the two polypeptides arefunctionally equivalent, as are the two nucleic acid molecules whichgive rise to their production, even though the differences between thenucleic acid molecules are not related to degeneracy of the geneticcode.

II. A Nucleic Acid Probe for the Detection of MDK1

A nucleic acid probe of the present invention may be used to probe anappropriate chromosomal or cDNA library by usual hybridization methodsto obtain another nucleic acid molecule of the present invention. Achromosomal DNA or cDNA library may be prepared from appropriate cellsaccording to recognized methods in the art (cf. Molecular Cloning: ALaboratory Manual, second edition, edited by Sambrook, Fritsch, &Maniatis, Cold Spring Harbor Laboratory, 1989).

In the alternative, chemical synthesis is carried out in order to obtainnucleic acid probes having nucleotide sequences which correspond toN-terminal and C-terminal portions of the amino acid sequence of thepolypeptide of interest. Thus, the synthesized nucleic acid probes maybe used as primers in a polymerase chain reaction (PCR) carried out inaccordance with recognized PCR techniques, essentially according to PCRProtocols, A Guide to Methods and Applications, edited by Michael etal., Academic Press, 1990, utilizing the appropriate chromosomal or cDNAlibrary to obtain the fragment of the present invention.

One skilled in the art can readily design such probes based on thesequence disclosed herein using methods of computer alignment andsequence analysis known in the art (cf. Molecular Cloning: A LaboratoryManual, second edition, edited by Sambrook, Fritsch, & Maniatis, ColdSpring Harbor Laboratory, 1989). The hybridization probes of the presentinvention can be labeled by standard labeling techniques such as with aradiolabel, enzyme label, fluorescent label, biotin-avidin label,chemiluminescence, and the like. After hybridization, the probes may bevisualized using known methods.

The nucleic acid probes of the present invention include RNA, as well asDNA probes, such probes being generated using techniques known in theart. The nucleic acid probe may be immobilized on a solid support.Examples of such solid supports include, but are not limited to,plastics such as polycarbonate, complex carbohydrates such as agaroseand sepharose, and acrylic resins, such as polyacrylamide and latexbeads. Techniques for coupling nucleic acid probes to such solidsupports are well known in the art.

The test samples suitable for nucleic acid probing methods of thepresent invention include, for example, cells or nucleic acid extractsof cells, or biological fluids. The sample used in the above-describedmethods will vary based on the assay format, the detection method andthe nature of the tissues, cells or extracts to be assayed. Methods forpreparing nucleic acid extracts of cells are well known in the art andcan be readily adapted in order to obtain a sample which is compatiblewith the method utilized.

III. A Probe Based Method And Kit For Detectinq MDK1

One method of detecting the presence of MDK1 in a sample comprises a)contacting said sample with the above-described nucleic acid probe,under conditions such that hybridization occurs, and b) detecting thepresence of said probe bound to said nucleic acid molecule. One skilledin the art would select the nucleic acid probe according to techniquesknown in the an as described above. Samples to be tested include butshould not be limited to RNA samples of human tissue.

A kit for detecting the presence of MDK1 in a sample comprises at leastone container means having disposed therein the above-described nucleicacid probe. The kit may further comprise other containers comprising oneor more of the following: wash reagents and reagents capable ofdetecting the presence of bound nucleic acid probe. Examples ofdetection reagents include, but are not limited to radiolabelled probes,enzymatic labeled probes (horse radish peroxidase, alkalinephosphatase), and affinity labeled probes (biotin, avidin, orsteptavidin).

In detail, a compartmentalized kit includes any kit in which reagentsare contained in separate containers. Such containers include smallglass containers, plastic containers or strips of plastic or paper. Suchcontainers allow the efficient transfer of reagents from one compartmentto another compartment such that the samples and reagents are notcross-contaminated and the agents or solutions of each container can beadded in a quantitative fashion from one compartment to another. Suchcontainers will include a container which will accept the test sample, acontainer which contains the probe or primers used in the assay,containers which contain wash reagents (such as phosphate bufferedsaline, Tris-buffers, and the like), and containers which contain thereagents used to detect the hybridized probe, bound antibody, amplifiedproduct, or the like. One skilled in the art will readily recognize thatthe nucleic acid probes described in the present invention can readilybe incorporated into one of the established kit formats which are wellknown in the art.

IV. DNA Constructs Comprising a MDK1 Nucleic Acid Molecule and CellsContaining These Constructs

The present invention also relates to a recombinant DNA moleculecomprising, 5′ to 3′, a promoter effective to initiate transcription ina host cell and the above-described nucleic acid molecules. In addition,the present invention relates to a recombinant DNA molecule comprising avector and an above-described nucleic acid molecules. The presentinvention also relates to a nucleic acid molecule comprising atranscriptional region functional in a cell, a sequence complimentary toan RNA sequence encoding an amino acid sequence corresponding to theabove-described polypeptide, and a transcriptional termination regionfunctional in said cell. The above-described molecules may be isolatedand/or purified DNA molecules.

The present invention also relates to a cell or organism that containsan above-described nucleic acid molecule. The peptide may be purifiedfrom cells which have been altered to express the peptide. A cell issaid to be “altered to express a desired peptide” when the cell, throughgenetic manipulation, is made to produce a protein which it normallydoes not produce or which the cell normally produces at lower levels.One skilled in the art can readily adapt procedures for introducing andexpressing either genomic, cDNA, or synthetic sequences into eithereukaryotic or prokaryotic cells.

A nucleic acid molecule, such as DNA, is said to be “capable ofexpressing” a polypeptide if it contains nucleotide sequences whichcontain transcriptional and translational regulatory information andsuch sequences are “operably linked” to nucleotide sequences whichencode the polypeptide. An operable linkage is a linkage in which theregulatory DNA sequences and the DNA sequence sought to be expressed areconnected in such a way as to permit gene sequence expression. Theprecise nature of the regulatory regions needed for gene sequenceexpression may vary from organism to organism, but shall in generalinclude a promoter region which, in prokaryotes, contains both thepromoter (which directs the initiation of RNA transcription) as well asthe DNA sequences which, when transcribed into RNA, will signalsynthesis initiation. Such regions will normally include those5′-non-coding sequences involved with initiation of transcription andtranslation, such as the TATA box, capping sequence, CAAT sequence, andthe like.

If desired, the non-coding region 3′ to the sequence encoding an MDK1gene may be obtained by the above-described methods. This region may beretained for its transcriptional termination regulatory sequences, suchas termination and polyadenylation. Thus, by retaining the 3′-regionnaturally contiguous to the DNA sequence encoding an MDK1 gene, thetranscriptional termination signals may be provided. Where thetranscriptional termination signals are not satisfactorily functional inthe expression host cell, then a 3′ region functional in the host cellmay be substituted.

Two DNA sequences (such as a promoter region sequence and an MDK1sequence) are said to be operably linked if the nature of the linkagebetween the two DNA sequences does not (1) result in the introduction ofa frame-shift mutation, (2) interfere with the ability of the promoterregion sequence to direct the transcription of an MDK1 gene sequence, or(3) interfere with the ability of the an MDK1 gene sequence to betranscribed by the promoter region sequence. Thus, a promoter regionwould be operably linked to a DNA sequence if the promoter were capableof effecting transcription of that DNA sequence. Thus, to express anMDK1 gene, transcriptional and translational signals recognized by anappropriate host are necessary.

The present invention encompasses the expression of the MDK1 gene (or afunctional derivative thereof) in either prokaryotic or eukaryoticcells. Prokaryotic hosts are, generally, very efficient and convenientfor the production of recombinant proteins and are, therefore, one typeof preferred expression system for the MDK1 gene. Prokaryotes mostfrequently are represented by various strains of E. coli. However, othermicrobial strains may also be used, including other bacterial strains.

In prokaryotic systems, plasmid vectors that contain replication sitesand control sequences derived from a species compatible with the hostmay be used. Examples of suitable plasmid vectors may include pBR322,pUC118, pUC119 and the like; suitable phage or bacteriophage vectors mayinclude γgt10, γgt11 and the like; and suitable virus vectors mayinclude pMAM-neo, pKRC and the like. Preferably, the selected vector ofthe present invention has the capacity to replicate in the selected hostcell.

Recognized prokaryotic hosts include bacteria such as E. coil, Bacillus,Streptomyces, Pseudomonas, Salmonella, Serratia, and the like. However,under such conditions, the peptide will not be glycosylated. Theprokaryotic host must be compatible with the replicon and controlsequences in the expression plasmid.

To express MDK1 (or a functional derivative thereof) in a prokaryoticcell, it is necessary to operably link the MDK1 sequence to a functionalprokaryotic promoter. Such promoters may be either constitutive or, morepreferably, regulatable (i.e., inducible or derepressible). Examples ofconstitutive promoters include the int promoter of bacteriophage λ, thebla promoter of the β-lactamase gene sequence of pBR322, and the CATpromoter of the chloramphenicol acetyl transferase gene sequence ofpPR325, and the like. Examples of inducible prokaryotic promotersinclude the major right and left promoters of bacteriophage λ (P_(L) andP_(R)), the trp, recA, lacZ, lacI, and gal promoters of E. coli, theα-amylase (Ulmanen et at., J. Bacteriol. 162:176-182(1985)) and theζ-28-specific promoters of B. subtilis (Gilman et at., Gene sequence32:11-20(1984)), the promoters of the bacteriophages of Bacillus(Gryczan, In: The Molecular Biology of the Bacilli, Academic Press,Inc., N.Y. (1982)), and Streptomyces promoters (Ward et at., Mol. Gen.Genet. 203:468-478(1986)). Prokaryotic promoters are reviewed by Glick(J. Ind. Microbiot. 1:277-282(1987)); Cenatiempo (Biochimie68:505-516(1986)); and Gottesman (Ann. Rev. Genet. 18:415-442(1984)).

Proper expression in a prokaryotic cell also requires the presence of aribosome binding site upstream of the gene sequence-encoding sequence.Such ribosome binding sites are disclosed, for example, by Gold et at.(Ann. Rev. Microbiol. 35:365-404(1981)). The selection of controlsequences, expression vectors, transformation methods, and the like, aredependent on the type of host cell used to express the gene. As usedherein, “cell”, “cell line”, and “cell culture” may be usedinterchangeably and all such designations include progeny. Thus, thewords “transformants” or “transformed cells” include the primary subjectcell and cultures derived therefrom, without regard to the number oftransfers. It is also understood that all progeny may not be preciselyidentical in DNA content, due to deliberate or inadvertent mutations.However, as defined, mutant progeny have the same functionality as thatof the originally transformed cell.

Host cells which may be used in the expression systems of the presentinvention are not strictly limited, provided that they are suitable foruse in the expression of the MDK1 peptide of interest. Suitable hostsmay often include eukaryotic cells. Preferred eukaryotic hosts include,for example, yeast, fungi, insect cells, mammalian cells either in vivo,or in tissue culture. Mammalian cells which may be useful as hostsinclude HeLa cells, cells of fibroblast origin such as VERO or CHO-K1,or cells of lymphoid origin and their derivatives. Preferred mammalianhost cells include SP2/0 and J558L, as well as neuroblastoma cell linessuch as IMR 332 which may provide better capacities for correctpost-translational processing.

In addition, plant cells are also available as hosts, and controlsequences compatible with plant cells are available, such as thecauliflower mosaic virus 35S and 19S, and nopaline synthase promoter andpolyadenylation signal sequences. Another preferred host is an insectcell, for example the Drosophila larvae. Using insect cells as hosts,the Drosophila alcohol dehydrogenase promoter can be used. Rubin,Science 240:1453-1459(1988). Alternatively, baculovirus vectors can beengineered to express large amounts of MDK1 in insects cells (Jasny,Science 238:1653(1987); Miller et al., In: Genetic Engineering (1986),Setlow, J. K., et al., eds., Plenum, Vol. 8, pp. 277-297).

Any of a series of yeast gene sequence expression systems can beutilized which incorporate promoter and termination elements from theactively expressed gene sequences coding for glycolytic enzymes areproduced in large quantities when yeast are grown in mediums rich inglucose. Known glycolytic gene sequences can also provide very efficienttranscriptional control signals. Yeast provides substantial advantagesin that it can also carry out post-translational peptide modifications.A number of recombinant DNA strategies exist which utilize strongpromoter sequences and high copy number of plasmids which can beutilized for production of the desired proteins in yeast. Yeastrecognizes leader sequences on cloned mammalian gene sequence productsand secretes peptides bearing leader sequences (i.e., pre-peptides). Fora mammalian host, several possible vector systems are available for theexpression of MDK1.

A wide variety of transcriptional and translational regulatory sequencesmay be employed, depending upon the nature of the host. Thetranscriptional and translational regulatory signals may be derived fromviral sources, such as adenovirus, bovine papilloma virus,cytomegalovirus, simian virus, or the like, where the regulatory signalsare associated with a particular gene sequence which has a high level ofexpression. Alternatively, promoters from mammalian expression products,such as actin, collagen, myosin, and the like, may be employed.Transcriptional initiation regulatory signals may be selected whichallow for repression or activation, so that expression of the genesequences can be modulated. Of interest are regulatory signals which aretemperature-sensitive so that by varying the temperature, expression canbe repressed or initiated, or are subject to chemical (such asmetabolite) regulation.

Expression of MDK1 in eukaryotic hosts requires the use of eukaryoticregulatory regions. Such regions will, in general, include a promoterregion sufficient to direct the initiation of RNA synthesis. Preferredeukaryotic promoters include, for example, the promoter of the mousemetallothionein I gene sequence (Hamer et al., J. Mol. Appl. Gen.1:273-288(1982)); the TK promoter of Herpes virus (McKnight, Cell31:355-365(1982)); the SV40 early promoter (Benoist et al., Nature(London) 290:304-310(1981)); the yeast gal4 gene sequence promoter(Johnston et al., Proc. Natl. Acad. Sci. (U.S.A.) 79:6971-6975(1982);Silver et al., Proc. Natl. Acad. Sci. (U.S.A.) 81:5951-5955(1984)).

Translation of eukaryotic mRNA is initiated at the codon which encodesthe first methionine. For this reason, it is preferable to ensure thatthe linkage between a eukaryotic promoter and a DNA sequence whichencodes MDK1 (or a functional derivative thereof) does not contain anyintervening codons which are capable of encoding a methionine (i.e.,AUG). The presence of such codons results either in a formation of afusion protein (if the AUG codon is in the same reading frame as theMDK1 coding sequence) or a frame-shift mutation (if the AUG codon is notin the same reading frame as the MDK1 coding sequence).

A MDK1 nucleic acid molecule and an operably linked promoter may beintroduced into a recipient prokaryotic or eukaryotic cell either as anonreplicating DNA (or RNA) molecule, which may either be a linearmolecule or, more preferably, a closed covalent circular molecule. Sincesuch molecules are incapable of autonomous replication, the expressionof the gene may occur through the transient expression of the introducedsequence. Alternatively, permanent expression may occur through theintegration of the introduced DNA sequence into the host chromosome.

A vector may be employed which is capable of integrating the desiredgene sequences into the host cell chromosome. Cells which have stablyintegrated the introduced DNA into their chromosomes can be selected byalso introducing one or more markers which allow for selection of hostcells which contain the expression vector. The marker may provide forprototrophy to an auxotrophic host, biocide resistance, e.g.,antibiotics, or heavy metals, such as copper, or the like. Theselectable marker gene sequence can either be directly linked to the DNAgene sequences to be expressed, or introduced into the same cell byco-transfection. Additional elements may also be needed for optimalsynthesis of single chain binding protein mRNA. These elements mayinclude splice signals, as well as transcription promoters, enhancers,and termination signals. cDNA expression vectors incorporating suchelements include those described by Okayama, Molec. Cell. Biol.3:280(1983).

The introduced nucleic acid molecule can be incorporated into a plasmidor viral vector capable of autonomous replication in the recipient host.Any of a wide variety of vectors may be employed for this purpose.Factors of importance in selecting a particular plasmid or viral vectorinclude: the ease with which recipient cells that contain the vector maybe recognized and selected from those recipient cells which do notcontain the vector; the number of copies of the vector which are desiredin a particular host; and whether it is desirable to be able to“shuttle” the vector between host cells of different species. Preferredprokaryotic vectors include plasmids such as those capable ofreplication in E. coli (such as, for example, pBR322, ColEl, pSC101,pACYC 184, πVX. Such plasmids are, for example, disclosed by Sambrook(cf. Molecular Cloning: A Laboratory Manual, second edition, edited bySambrook, Fritsch, & Maniatis, Cold Spring Harbor Laboratory, (1989)).Bacillus plasmids include pC194, pC221, pT127, and the like. Suchplasmids are disclosed by Gryczan (In: The Molecular Biology of theBacilli, Academic Press, NY (1982), pp. 307-329). Suitable Streptomycesplasmids include p1J101(Kendall et al., J. Bacteriol.169:4177-4183(1987)), and streptomyces bacteriophages such asøC31(Chater et al., In: Sixth International Symposium on ActinomycetalesBiology, Akademiai Kaido, Budapest, Hungary (1986), pp. 45-54).Pseudomonas plasmids are reviewed by John et al. (Rev. Infect. Dis.8:693-704(1986)), and Izaki (Jpn. J. Bacteriol. 33:729-742(1978)).

Preferred eukaryotic plasmids include, for example, BPV, vaccinia, SV40,2-micron circle, and the like, or their derivatives. Such plasmids arewell known in the art (Botstein et al., Miami Wntr. Symp.19:265-274(1982); Broach, In: The Molecular Biology of the YeastSaccharomyces: Life Cycle and Inheritance, Cold Spring HarborLaboratory, Cold Spring Harbor, N.Y., p. 445-470 (1981); Broach, Cell28:203-204(1982); Bollon et at., J. Ctin. Hematol. Oncol.10:39-48(1980); Maniatis, In: Cell Biology: A Comprehensive Treatise,Vol. 3, Gene Sequence Expression, Academic Press, N.Y., pp.563-608(1980).

Once the vector or nucleic acid molecule containing the construct(s) hasbeen prepared for expression, the DNA construct(s) may be introducedinto an appropriate host cell by any of a variety of suitable means,i.e., transformation, transfection, conjugation, protoplast fusion,electroporation, particle gun technology, calciumphosphate-precipitation, direct microinjection, and the like. After theintroduction of the vector, recipient cells are grown in a selectivemedium, which selects for the growth of vector-containing cells.Expression of the cloned gene molecule(s) results in the production ofMDK1 or fragments thereof. This can take place in the transformed cellsas such, or following the induction of these cells to differentiate (forexample, by administration of bromodeoxyuracil to neuroblastoma cells orthe like). A variety of incubation conditions can be used to form thepeptide of the present invention. The most preferred conditions arethose which mimic physiological conditions.

V. Purified MDK1 Polypeptides

A variety of methodologies known in the art can be utilized to obtainthe peptide of the present invention. The peptide may be purified fromtissues or cells which naturally produce the peptide. Alternatively, theabove-described isolated nucleic acid fragments could be used toexpressed the MDK1 protein in any organism. The samples of the presentinvention include cells, protein extracts or membrane extracts of cells,or biological fluids. The sample will vary based on the assay format,the detection method and the nature of the tissues, cells or extractsused as the sample.

Any eukaryotic organism can be used as a source for the peptide of theinvention, as long as the source organism naturally contains such apeptide. As used herein, “source organism” refers to the originalorganism from which the amino acid sequence of the subunit is derived,regardless of the organism the subunit is expressed in and ultimatelyisolated from.

One skilled in the art can readily follow known methods for isolatingproteins in order to obtain the peptide free of natural contaminants.These include, but are not limited to: size-exclusion chromatography,HPLC, ion-exchange chromatography, and immuno-affinity chromatography.

VI. An Antibody Having Binding Affinity To A MDK1 Polypeptide and aHybridoma Containing the Antibody

The present invention relates to an antibody having binding affinity toa MDK1 polypeptide. The polypeptide may have the amino acid sequence setforth in SEQ ID NO:2, or functional derivitive thereof, or at least 9contiguous amino acids thereof (preferably, at least 10, 15, 20, or 30contiguous amino acids thereof).

The present invention also relates to an antibody having specificbinding affinity to an MDK1 polypeptide. Such an antibody may beisolated by comparing its binding affinity to a MDK1 polypeptide withits binding affinity to another polypeptide. Those which bindselectively to MDK1 would be chosen for use in methods requiring adistinction between MDK1 and other polypeptides. Such methods couldinclude, but should not be limited to, the analysis of altered MDK1expression in tissue containing other polypeptides such as FAK.

The MDK1 proteins of the present invention can be used in a variety ofprocedures and methods, such as for the generation of antibodies, foruse in identifying pharmaceutical compositions, and for studyingDNA/protein interaction.

The MDK1 peptide of the present invention can be used to produceantibodies or hybridomas. One skilled in the art will recognize that ifan antibody is desired, such a peptide would be generated as describedherein and used as an immunogen. The antibodies of the present inventioninclude monoclonal and polyclonal antibodies, as well fragments of theseantibodies, and humanized forms. Humanized forms of the antibodies ofthe present invention may be generated using one of the procedures knownin the art such as chimerization or CDR grafting. The present inventionalso relates to a hybridoma which produces the above-describedmonoclonal antibody, or binding fragment thereof. A hybridoma is animmortalized cell line which is capable of secreting a specificmonoclonal antibody.

In general, techniques for preparing monoclonal antibodies andhybridomas are well known in the art (Campbell, “Monoclonal AntibodyTechnology: Laboratory Techniques in Biochemistry and MolecularBiology,” Elsevier Science Publishers, Amsterdam, The Netherlands(1984); St. Groth et al., J. Immunol. Methods 35:1-21(1980)). Any animal(mouse, rabbit, and the like) which is known to produce antibodies canbe immunized with the selected polypeptide. Methods for immunization arewell known in the art. Such methods include subcutaneous orintraperitoneal injection of the polypeptide. One skilled in the artwill recognize that the amount of polypeptide used for immunization willvary based on the animal which is immunized, the antigenicity of thepolypeptide and the site of injection.

The polypeptide may be modified or administered in an adjuvant in orderto increase the peptide antigenicity. Methods of increasing theantigenicity of a polypeptide are well known in the art. Such proceduresinclude coupling the antigen with a heterologous protein (such asglobulin or β-galactosidase) or through the inclusion of an adjuvantduring immunization.

For monoclonal antibodies, spleen cells from the immunized animals areremoved, fused with myeloma cells, such as SP2/0-Ag14 myeloma cells, andallowed to become monoclonal antibody producing hybridoma cells. Any oneof a number of methods well known in the art can be used to identify thehybridoma cell which produces an antibody with the desiredcharacteristics. These include screening the hybridomas with an ELISAassay, western blot analysis, or radioimmunoassay (Lutz et al., Exp.Cell Res. 175:109-124(1988)). Hybridomas secreting the desiredantibodies are cloned and the class and subclass is determined usingprocedures known in the art (Campbell, Monoclonal Antibody Technology:Laboratory Techniques in Biochemistry and Molecular Biology, supra(1984)).

For polyclonal antibodies, antibody containing antisera is isolated fromthe immunized animal and is screened for the presence of antibodies withthe desired specificity using one of the above-described procedures. Theabove-described antibodies may be detectably labeled. Antibodies can bedetectably labeled through the use of radioisotopes, affinity labels(such as biotin, avidin, and the like), enzymatic labels (such as horseradish peroxidase, alkaline phosphatase, and the like) fluorescentlabels (such as FITC or rhodamine, and the like), paramagnetic atoms,and the like. Procedures for accomplishing such labeling are well-knownin the art, for example, see (Stemberger et al., J. Histochem. Cytochem.18:315(1970); Bayer et at., Meth. Enzym. 62:308(1979); Engval et al.,Immunot. 109:129(1972); Goding, J. Immunol. Meth. 13:215(1976)). Thelabeled antibodies of the present invention can be used for in vitro, invivo, and in situ assays to identify cells or tissues which express aspecific peptide.

The above-described antibodies may also be immobilized on a solidsupport. Examples of such solid supports include plastics such aspolycarbonate, complex carbohydrates such as agarose and sepharose,acrylic resins and such as polyacrylamide and latex beads. Techniquesfor coupling antibodies to such solid supports are well known in the art(Weir et al., “Handbook of Experimental Immunology” 4th Ed., BlackwellScientific Publications, Oxford, England, Chapter 10(1986); Jacoby etal., Meth. Enzym. 34 Academic Press, N.Y. (1974)). The immobilizedantibodies of the present invention can be used for in vitro, in vivo,and in situ assays as well as in immunochromotography.

Furthermore, one skilled in the art can readily adapt currentlyavailable procedures, as well as the techniques, methods and kitsdisclosed above with regard to antibodies, to generate peptides capableof binding to a specific peptide sequence in order to generaterationally designed antipeptide peptides, for example see Hurby et al.,“Application of Synthetic Peptides: Antisense Peptides”, In SyntheticPeptides, A User's Guide, W. H. Freeman, N.Y., pp. 289-307(1992), andKaspczak et al., Biochemistry 28:9230-8(1989).

Anti-peptide peptides can be generated by replacing the basic amino acidresidues found in the MDK1 peptide sequence with acidic residues, whilemaintaining hydrophobic and uncharged polar groups. For example, lysine,arginine, and/or histidine residues are replaced with aspartic acid orglutamic acid and glutamic acid residues are replaced by lysine,arginine or histidine.

VII. An Antibody Based Method and Kit for Detecting MDK1

The present invention encompasses a method of detecting an MDK1polypeptide in a sample, comprising: a) contacting the sample with anabove-described antibody, under conditions such that immunocomplexesform, and b) detecting the presence of said antibody bound to thepolypeptide. In detail, the methods comprise incubating a test samplewith one or more of the antibodies of the present invention and assayingwhether the antibody binds to the test sample. Altered levels of MDK1 ina sample as compared to normal levels may indicate muscular disease.

Conditions for incubating an antibody with a test sample vary.Incubation conditions depend on the format employed in the assay, thedetection methods employed, and the type and nature of the antibody usedin the assay. One skilled in the art will recognize that any one of thecommonly available immunological assay formats (such asradioimmunoassays, enzyme-linked immunosorbent assays, diffusion basedOuchterlony, or rocket immunofluorescent assays) can readily be adaptedto employ the antibodies of the present invention. Examples of suchassays can be found in Chard, “An Introduction to Radioimmunoassay andRelated Techniques” Elsevier Science Publishers, Amsterdam, TheNetherlands (1986); Bullock et al., “Techniques in Immunocytochemistry,”Academic Press, Orlando, Fla. Vol. 1(1982), Vol. 2 (1983), Vol. 3(1985); Tijssen, “Practice and Theory of Enzyme Immunoassays: LaboratoryTechniques in Biochemistry and Molecular Biology,” Elsevier SciencePublishers, Amsterdam, The Netherlands (1985).

The immunological assay test samples of the present invention includecells, protein or membrane extracts of cells, or biological fluids suchas blood, serum, plasma, or urine. The test sample used in theabove-described method will vary based on the assay format, nature ofthe detection method and the tissues, cells or extracts used as thesample to be assayed. Methods for preparing protein extracts or membraneextracts of cells are well known in the art and can be readily beadapted in order to obtain a sample which is capable with the systemutilized.

A kit contains all the necessary reagents to carry out the previouslydescribed methods of detection. The kit may comprise: i) a firstcontainer means containing an above-described antibody, and ii) secondcontainer means containing a conjugate comprising a binding partner ofthe antibody and a label. In another preferred embodiment, the kitfurther comprises one or more other containers comprising one or more ofthe following: wash reagents and reagents capable of detecting thepresence of bound antibodies.

Examples of detection reagents include, but are not limited to, labeledsecondary antibodies, or in the alternative, if the primary antibody islabeled, the chromophoric, enzymatic, or antibody binding reagents whichare capable of reacting with the labeled antibody. The compartmentalizedkit may be as described above for nucleic acid probe kits. One skilledin the art will readily recognize that the antibodies described in thepresent invention can readily be incorporated into one of theestablished kit formats which are well known in the art.

VIII. Isolation of Compounds which Interact with MDK1

The present invention also relates to a method of detecting a compoundcapable of binding to a MDK1 polypeptide comprising incubating thecompound with MDK1 and detecting the presence of the compound bound toMDK1. The compound may be present within a complex mixture, for example,serum, body fluid, or cell extracts.

The present invention also relates to a method of detecting an agonistor antagonist of MDK1 activity comprising incubating cells that produceMDK1 in the presence of a compound and detecting changes in the level ofMDK1 activity. The compounds thus identified would produce a change inactivity indicative of the presence of the compound. The compound may bepresent within a complex mixture, for example, serum, body fluid, orcell extracts. Once the compound is identified it can be isolated usingtechniques well known in the art.

The present invention also encompasses a method of agonizing(stimulating) or antagonizing MDK1 associated activity in a mammalcomprising administering to said mammal an agonist or antagonist to MDK1in an amount sufficient to effect said agonism or antagonism. A methodof treating diabetes mellitus, skeletal muscle diseases, Alzheimer'sdisease, or peripheral neuropathies in a mammal with an agonist orantagonist of MDK1 activity comprising administering the agonist orantagonist to a mammal in an amount sufficient to agonize or antagonizeMDK1 associated functions is also encompassed in the presentapplication.

IX. Transgenic Animals

A variety of methods are available for the production of transgenicanimals associated with this invention. DNA can be injected into thepronucleus of a fertilized egg before fusion of the male and femalepronuclei, or injected into the nucleus of an embryonic cell (e.g., thenucleus of a two-cell embryo) following the initiation of cell division(Brinster et al., Proc. Nat. Acad. Sci. U.S.A. 82:4438-4442 (1985)).Embryos can be infected with viruses, especially retroviruses, modifiedto carry inorganic-ion receptor nucleotide sequences of the invention.

Pluripotent stem cells derived from the inner cell mass of the embryoand stabilized in culture can be manipulated in culture to incorporatenucleotide sequences of the invention. A transgenic animal can beproduced from such cells through implantation into a blastocyst that isimplanted into a foster mother and allowed to come to term. Animalssuitable for transgenic experiments can be obtained from standardcommercial sources such as Charles River (Wilmington, Mass.), Taconic(Germantown, N.Y.), Harlan Sprague Dawley (Indianapolis, Ind.), etc.

The procedures for manipulation of the rodent embryo and formicroinjection of DNA into the pronucleus of the zygote are well knownto those of ordinary skill in the art (Hogan et al., supra).Microinjection procedures for fish, amphibian eggs and birds aredetailed in Houdebine and Chourrout, Experientia 47:897-905 (1991).Other procedures for introduction of DNA into tissues of animals aredescribed in U.S. Pat. No., 4,945,050 (Sandford et al., Jul. 30, 1990).

By way of example only, to prepare a transgenic mouse, female mice areinduced to superovulate. Females are placed with males, and the matedfemales are sacrificed by CO₂ asphyxiation or cervical dislocation andembryos are recovered from excised oviducts. Surrounding cumulus cellsare removed. Pronuclear embryos are then washed and stored until thetime of injection. Randomly cycling adult female mice are paired withvasectomized males. Recipient females are mated at the same time asdonor females. Embryos then are transferred surgically. The procedurefor generating transgenic rats is similar to that of mice. See Hammer etal., Cell 63:1099-1112 (1990).

Methods for the culturing of embryonic stem (ES) cells and thesubsequent production of transgenic animals by the introduction of DNAinto ES cells using methods such as electroporation, calciumphosphate/DNA precipitation and direct injection also are well known tothose of ordinary skill in the art. See, for example, Teratocarcinomasand Embryonic Stem Cells. A Practical Approach, E. J. Robertson, ed.,IRL Press (1987).

In cases involving random gene integration, a clone containing thesequences(s) of the invention is co-transfected with a gene encodingresistance. Alternatively, the gene encoding neomycin resistance isphysically linked to the sequence(s) of the invention. Transfection andisolation of desired clones are carried out by any one of severalmethods well known to those of ordinary skill in the art (E. J.Robertson, supra).

DNA molecules introduced into ES cells can also be integrated into thechromosome through the process of homologous recombination. Capecchi,Science 244: 1288-1292 (1989). Methods for positive selection of therecombination event (i.e., neo resistance) and dual positive-negativeselection (i.e., neo resistance and gancyclovir resistance) and thesubsequent identification of the desired clones by PCR have beendescribed by Capecchi, supra and Joyner et al., Nature 338:153-156(1989), the teachings of which are incorporated herein. The final phaseof the procedure is to inject targeted ES cells into blastocysts and totransfer the blastocysts into pseudopregnant females. The resultingchimeric animals are bred and the offspring are analyzed by Southernblotting to identify individuals that carry the transgene. Proceduresfor the production of non-rodent mammals and other animals have beendiscussed by others. See Houdebine and Chourrout, supra; Pursel et al.,Science 244:1281-1288 (1989); and Simms et al., Bio/Technology 6:179-183(1988).

X. Compositions

The present invention relates to removing or reducing an abnormality ina signal transduction pathway, wherein the signal transduction pathwaycontains a MDK1 receptor tyrosine kinase and a MDK1 binding partner. Thepresent invention also relates to compositions and methods for thetreatment of disorders which involve modulating the activity and/orlevel of individual components, and relates to methods for theidentification of agents for such treatments. Additionally, the presentinvention relates to methods and compositions for prognostic evaluationof such disorders.

Described herein are compositions and methods for the prevention,prognostic evaluation, and treatment of neurodegenerative orneuroproliferative disorders, especially disorders such as Alzheimer'sdisease, Parkinson's disease, Lou Gehrig's disease (ALS), trauma,damaged or severed nerve injuries, Huntington's chorea, multiplesclerosis, muscular dystrophy, syringomiplia, Tabes Dorsalis,cardiovascular accidents, and other disorders described herein, in whicha MDK1 receptor tyrosine kinase may be involved. Also described arecompositions and methods for the prevention, prognostic evaluation andtreatment of cell proliferative disorders, especially cancer, in which aMDK1 receptor tyrosine kinase is involved.

First, methods and compositions for the treatment of such disorders aredescribed. Such methods and compositions may include, but are notlimited to the agents capable of decreasing or inhibiting theinteraction between a MDK1 receptor tyrosine kinase and a MDK1 bindingpartner and agents capable of inhibiting or decreasing the activity ofsuch complexes, agents capable of modulating the activity and/or levelof individual components of the proteins, and the use and administrationof such agents. Agents capable of modulating the activity and/or levelof interaction between MDK1 receptor tyrosine kinase and a MDK1 bindingpartner include those agents that inhibit or decrease thedephosphorylating activity of tyrosine phosphatases.

Second, methods are described for the identification of such agents.These methods may include, for example, assays to identify agentscapable of disrupting or inhibiting or promoting the interaction betweencomponents of the complexes (e.g., MDK1:binding partner complexes), andmay also include paradigms and strategies for the rational design ofdrugs capable of disruption and/or inhibition and/or promotion of suchcomplexes.

XI. Binding Partner/Receptor Complexes

The complexes involved in the invention include a MDK1 receptor tyrosinekinase and a MDK1 binding partner or derivatives thereof, as describedbelow. Under standard physiological conditions, the components of suchcomplexes are capable of forming stable, non-covalent attachments withone or more of the other complex components. Methods for thepurification and production of such protein complexes, and of cells thatexhibit such complexes are described below.

The complexes involved in the invention also include tyrosinephosphatases responsible for dephosphorylating activated MDK1 receptors,thus modulating the ability to bind to a binding partner and othersignal transduction components. Identification of such tyrosinephosphatase(s) may be accomplished using techniques known to one skilledin the art.

XII. Disruption of Protein Complexes

Disruption of complexes (e.g., MDK1:binding partner complexes), forexample by decreasing or inhibiting or promoting the interactionsbetween component members of such a complex may have differingmodulatory effects on the event involved, depending on the individualprotein complex. “Disruption”, as used here, is meant to refer not onlyto a physical separation of protein complex components, but also refersto a perturbation of the activity of the complexes, regardless ofwhether or not such complexes remain able, physically, to form.“Activity”, as used here, refers to the function of the protein complexin the signal transduction cascade of the cell in which such a complexis formed, i.e., refers to the function of the complex in effecting orinhibiting a transduction of an extracellular signal into a cell. Forexample, the effect of complex disruption may augment, reduce, or blocka signal normally transduced into the cell. Likewise, depending on thedisorder involved, either augmentation, reduction, or blockage of asignal normally transduced into the cell will be desirable for thetreatment of the disorder.

A disorder involving a complex may, for example, develop because thepresence of such a complex brings about the aberrant inhibition of anormal signal transduction event. In such a case, the disruption of thecomplex would allow the restoration of the usual signal transductionevent. Further, an aberrant complex may bring about an alteredsubcellular adapter protein localization, which may result in, forexample, dysfunctional cellular events. An inhibition of the complex inthis case would allow for restoration or maintenance of a normalcellular architecture. Still further, an agent or agents that cause(s)disruption of the complex may bring about the disruption of theinteractions among other potential components of a complex.

Nucleotide sequences encoding peptide agents which are to be utilizedintracellularly may be expressed in the cells of interest, usingtechniques which are well known to those of ordinary skill in the art.For example, expression vectors derived from viruses such asretroviruses, vaccinia virus, adenoviruses, adeno-associated virus,herpes viruses, or bovine papilloma virus, may be used for delivery andexpression of such nucleotide sequences into the targeted cellpopulation. Methods for the construction of such vectors are well known.See, for example, the techniques described in Maniatis et al., 1989,Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory,N.Y. and in Ausubel et al., Current Protocols in Molecular Biology,Greene Publishing Associates and Wiley Interscience, N.Y, 1989.Complex-binding domains can be identified using, for example, techniquessuch as those described in Rotin et al. (Rotin et al., EMBO J.11:559-567, 1992), Songyang et al. (Songyang et al., Cell 72:767-778,1993), Felder et al., Mol. Cell. Biol. 13:1449-1455, 1993), Fantl et al.(Cell 69:413-422, 1992), and Domchek et al. (Biochemistry 31:9865-9870,1992).

Alternatively, antibodies capable of interfering with complex formationmay be produced as described below and administered for the treatment ofdisorders involving a component capable of forming a complex withanother protein. For example, neutralizing antibodies which are capableof interfering with ligand binding may be administered using standardtechniques. Alternatively, nucleotide sequences encoding single-chainantibodies may be expressed within the target cell population byutilizing, for example, techniques such as those described in Marasco etal. (Marasco et al., Proc. Natl. Acad. Sci. USA 90:7889-7893, 1993).

Agents which act intracellularly to interfere with the formation and/oractivity of the protein complexes of the invention may also be smallorganic or inorganic compounds. A method for identifying these and otherintracellular agents is described below.

XIII. Antibodies to Complexes

Described herein are methods for the production of antibodies which arecapable of specifically recognizing a complex or an epitope thereof, orof specifically recognizing an epitope on either of the components ofthe complex, especially those epitopes which would not be recognized bythe antibody when the component is present separate and apart from thecomplex. Such antibodies may include, but are not limited to polyclonalantibodies, monoclonal antibodies (mAbs), humanized or chimericantibodies, single chain antibodies, Fab fragments, F(ab′)₂ fragments,fragments produced by a FAb expression library, anti-idiotypic (anti-Id)antibodies, and epitope-binding fragments of any of the above. Suchantibodies may be used, for example, in the detection of a complex in abiological sample, or, alternatively, as a method for the inhibition ofa complex formation, thus inhibiting the development of a disorder.

Polyclonal antibodies are heterogeneous populations of antibodymolecules derived from the sera of animals immunized with an antigen,such as a complex, or an antigenic functional derivative thereof. Forthe production of polyclonal antibodies, various host animals may beimmunized by injection with the complex including but not limited torabbits, mice, rats, etc. Various adjuvants may be used to increase theimmunological response, depending on the host species, including but notlimited to Freund's (complete and incomplete), mineral gels such aslysolecithin, pluronic polyols, polyanions, peptides, oil emulsions,keyhole limpet hemocyanin, dinitrophenol, and potentially useful humanadjuvants such as BCG (bacille Calmette-Guerin) and Corynebacteriumparvum.

A monoclonal antibody, which is a substantially homogeneous populationof antibodies to a particular antigen, may be obtained by any techniquewhich provides for the production of antibody molecules by continuouscell lines in culture. These include, but are not limited to thehybridoma technique of Kohler and Milstein (Nature 256:495-497, 1975)and U.S. Pat. No. 4,376,110), the human B-cell hybridoma technique(Kosbor et al., Immunology Today 4:72, 1983; Cole et al., Proc. Natl.Acad. Sci. USA 80:2026-2030, 1983), and the EBV-hybridoma technique(Cole et al., Monoclonal Antibodies And Cancer Therapy, Alan R. Liss,Inc., 1985, pp. 77-96). Such antibodies may be of any immunoglobulinclass including IgG, IgM, IgE, IgA, IgD and any subclass thereof. Thehybridoma producing the mAb of this invention may be cultivated in vitroor in vivo. Production of high titers of mAbs in vivo makes this thepresently preferred method of production.

In addition, techniques developed for the production of “chimericantibodies” (Morrison et al., Proc. Natl. Acad. Sci., 81:6851-6855,1984; Neuberger et al., Nature, 312:604-608, 1984; Takeda et al.,Nature, 314:452-454, 1985) by splicing the genes from a mouse antibodymolecule of appropriate antigen specificity together with genes from ahuman antibody molecule of appropriate biological activity can be used.A chimeric antibody is a molecule in which different portions arederived from different animal species, such as those having a variableregion derived from a murine mAb and a human immunoglobulin constantregion.

Alternatively, techniques described for the production of single chainantibodies (U.S. Pat. No. 4,946,778; Bird, Science 242:423-426, 1988;Huston et al., Proc. Natl. Acad. Sci. USA 85:5879-5883, 1988; and Wardet al., Nature 334:544-546, 1989) can be adapted to producecomplex-specific single chain antibodies. Single chain antibodies areformed by linking the heavy and light chain fragment of the Fv regionvia an amino acid bridge, resulting in a single chain polypeptide.

Antibody fragments which contain specific binding sites of a complex maybe generated by known techniques. For example, such fragments includebut are not limited to: the F(ab′)₂ fragments which can be produced bypepsin digestion of the antibody molecule and the Fab fragments whichcan be generated by reducing the disulfide bridges of the F(ab′)₂fragments. Alternatively, Fab expression libraries may be constructed(Huse et al., 1989, Science, 246:1275-1281) to allow rapid and easyidentification of monoclonal Fab fragments with the desired specificityto the PTK/adapter complex.

One or more components of a protein complex may be present at a higherthan normal cellular level (i.e., higher than the concentration known tousually be present in the cell type exhibiting the protein complex ofinterest) and/or may exhibit an abnormally increased level of cellularactivity (i.e., greater than the activity known to usually be present inthe cell type exhibiting the protein complex of interest).

For example, the gene encoding a protein complex component may begin tobe overexpressed, or may be amplified (i.e., its gene copy number may beincreased) in certain cells, leading to an increased number of componentmolecules within these cells. Additionally, a gene encoding a proteincomplex component may begin to express a modified protein product thatexhibits a greater than normal level of activity. “Activity”, here,refers to the normal cellular function of the component, eitherenzymatic or structural whose function may include, for example,bringing two or more cellular molecules into the appropriate proximity.

Such an increase in the cellular level and/or activity of a proteincomplex may lead to the development of a disorder. Treatment of suchdisorders may, therefore, be effectuated by the administration of agentswhich decrease the cellular level and/or the activity of theoverexpressed and/or overactive protein complex component.

Techniques for decreasing the cellular level and/or the activity of oneor more of the protein complex components of interest may include, butare not limited to antisense or ribozyme approaches, and/or gene therapyapproaches, each of which is well known to those of skill in the art.

XIV. Antisense and Ribozyme Approaches to Provide or Disrupt theComplexes of the Present Invention

Included in the scope of the invention are oligoribonucleotides,including antisense RNA and DNA molecules and ribozymes that function toinhibit translation of one or more components of a protein complex.Anti-sense RNA and DNA molecules act to directly block the translationof mRNA by binding to targeted mRNA and preventing protein translation.With respect to antisense DNA, oligodeoxyribonucleotides derived fromthe translation initiation site, e.g., between −10 and +10 regions ofthe relevant nucleotide sequence, are preferred.

Ribozymes are enzymatic RNA molecules capable of catalyzing the specificcleavage of RNA. The mechanism of ribozyme action involves sequencespecific interaction of the ribozyme molecule to complementary targetRNA, followed by a endonucleolytic cleavage. Within the scope of theinvention are engineered hammerhead or other motif ribozyme moleculesthat specifically and efficiently catalyze endonucleolytic cleavage ofRNA sequences encoding protein complex components.

Specific ribozyme cleavage sites within any potential RNA target areinitially identified by scanning the target molecule for ribozymecleavage sites which include the following sequences, GUA, GUU and GUC.Once identified, short RNA sequences of between 15 and 20ribonucleotides corresponding to the region of the target genecontaining the cleavage site may be evaluated for predicted structuralfeatures, such as secondary structure, that may render theoligonucleotide sequence unsuitable. The suitability of candidatetargets may also be evaluated by testing their accessibility tohybridization with complementary oligonucleotides, using ribonucleaseprotection assays. See, Draper PCT WO 93/23569.

Both anti-sense RNA and DNA molecules and ribozymes of the invention maybe prepared by any method known in the art for the synthesis of RNAmolecules. See, Draper, id. hereby incorporated by reference herein.These include techniques for chemically synthesizingoligodeoxyribonucleotides well known in the art such as for examplesolid phase phosphoramidite chemical synthesis. Alternatively, RNAmolecules may be generated by in vitro and in vivo transcription of DNAsequences encoding the antisense RNA molecule. Such DNA sequences may beincorporated into a wide variety of vectors which incorporate suitableRNA polymerase promoters such as the T7 or SP6 polymerase promoters.Alternatively, antisense cDNA constructs that synthesize antisense RNAconstitutively or inducibly, depending on the promoter used, can beintroduced stably into cell lines.

Various modifications to the DNA molecules may be introduced as a meansof increasing intracellular stability and half-life. Possiblemodifications include but are not limited to the addition of flankingsequences of ribo- or deoxy-nucleotides to the 5′ and/or 3′ ends of themolecule or the use of phosphorothioate or 2′ O-methyl rather thanphosphodiesterase linkages within the oligodeoxyribonucleotide backbone.

Gene Therapy

MDK1 or its genetic sequences will also be useful in gene therapy(reviewed in Miller, Nature 357:455-460, (1992). Miller states thatadvances have resulted in practical approaches to human gene therapythat have demonstrated positive initial results. The basic science ofgene therapy is described in Mulligan, Science 260:926-931, (1993).

In one preferred embodiment, an expression vector containing the MDK1coding sequence is inserted into cells, the cells are grown in vitro andthen infused in large numbers into patients. In another preferredembodiment, a DNA segment containing a promoter of choice (for example astrong promoter) is transferred into cells containing an endogenous MDK1in such a manner that the promoter segment enhances expression of theendogenous MDK1 gene (for example, the promoter segment is transferredto the cell such that it becomes directly linked to the endogenous MDK1gene).

The gene therapy may involve the use of an adenovirus containing MDK1cDNA targeted to a tumor, systemic MDK1 increase by implantation ofengineered cells, injection with MDK1 virus, or injection of naked MDK1DNA into appropriate tissues.

Target cell populations may be modified by introducing altered forms ofone or more components of the protein complexes in order to modulate theactivity of such complexes. For example, by reducing or inhibiting acomplex component activity within target cells, an abnormal signaltransduction event(s) leading to a condition may be decreased,inhibited, or reversed. Deletion or missense mutants of a component,that retain the ability to interact with other components of the proteincomplexes but cannot function in signal transduction may be used toinhibit an abnormal, deleterious signal transduction event.

Expression vectors derived from viruses such as retroviruses, vacciniavirus, adenovirus, adeno-associated virus, herpes viruses, several RNAviruses, or bovine papilloma virus, may be used for delivery ofnucleotide sequences (e.g., cDNA) encoding recombinant MDK1 protein intothe targeted cell population (e.g., tumor cells). Methods which are wellknown to those skilled in the art can be used to construct recombinantviral vectors containing coding sequences. See, for example, thetechniques described in Maniatis et al., Molecular Cloning: A LaboratoryManual, Cold Spring Harbor Laboratory, N.Y. (1989), and in Ausubel etal., Current Protocols in Molecular Biology, Greene PublishingAssociates and Wiley Interscience, N.Y. (1989). Alternatively,recombinant nucleic acid molecules encoding protein sequences can beused as naked DNA or in reconstituted system e.g., liposomes or otherlipid systems for delivery to target cells (See e.g., Felgner et al.,Nature 337:387-8, 1989). Several other methods for the direct transferof plasmid DNA into cells exist for use in human gene therapy andinvolve targeting the DNA to receptors on cells by complexing theplasmid DNA to proteins. See, Miller, supra.

In its simplest form, gene transfer can be performed by simply injectingminute amounts of DNA into the nucleus of a cell, through a process ofmicroinjection. Capecchi M R, Cell 22:479-88 (1980). Once recombinantgenes are introduced into a cell, they can be recognized by the cellsnormal mechanisms for transcription and translation, and a gene productwill be expressed. Other methods have also been attempted forintroducing DNA into larger numbers of cells. These methods include:transfection, wherein DNA is precipitated with CaPO₄ and taken intocells by pinocytosis (Chen C. and Okayama H, Mol. Cell Biol. 7:2745-52(1987)); electroporation, wherein cells are exposed to large voltagepulses to introduce holes into the membrane (Chu G. et al., NucleicAcids Res., 15:1311-26 (1987)); lipofection/liposome fusion, wherein DNAis packaged into lipophilic vesicles which fuse with a target cell(Felgner P L., et al., Proc. Natl. Acad. Sci. USA. 84:7413-7 (1987));and particle bombardment using DNA bound to small projectiles (Yang N S.et al., Proc. Natl. Acad. Sci. 87:9568-72 (1990)). Another method forintroducing DNA into cells is to couple the DNA to chemically modifiedproteins.

It has also been shown that adenovirus proteins are capable ofdestabilizing endosomes and enhancing the uptake of DNA into cells. Theadmixture of adenovirus to solutions containing DNA complexes, or thebinding of DNA to polylysine covalently attached to adenovirus usingprotein crosslinking agents substantially improves the uptake andexpression of the recombinant gene. Curiel DT et al., Am. J. Respir.Cell. Mol. Biol., 6:247-52 (1992).

As used herein “gene transfer” means the process of introducing aforeign nucleic acid molecule into a cell. Gene transfer is commonlyperformed to enable the expression of a particular product encoded bythe gene. The product may include a protein, polypeptide, anti-sense DNAor RNA, or enzymatically active RNA. Gene transfer can be performed incultured cells or by direct administration into animals. Generally genetransfer involves the process of nucleic acid contact with a target cellby non-specific or receptor mediated interactions, uptake of nucleicacid into the cell through the membrane or by endocytosis, and releaseof nucleic acid into the cytoplasm from the plasma membrane or endosome.Expression may require, in addition, movement of the nucleic acid intothe nucleus of the cell and binding to appropriate nuclear factors fortranscription.

As used herein “gene therapy” is a form of gene transfer and is includedwithin the definition of gene transfer as used herein and specificallyrefers to gene transfer to express a therapeutic product from a cell invivo or in vitro. Gene transfer can be performed ex vivo on cells whichare then transplanted into a patient, or can be performed by directadministration of the nucleic acid or nucleic acid-protein complex intothe patient.

In another preferred embodiment, a vector having nucleic acid sequencesencoding MDK1 is provided in which the nucleic acid sequence isexpressed only in specific tissue. Methods of achieving tissue-specificgene expression as set forth in International Publication No. WO93/09236, filed Nov. 3, 1992 and published May 13, 1993.

In all of the preceding vectors set forth above, a further aspect of theinvention is that the nucleic acid sequence contained in the vector mayinclude additions, deletions or modifications to some or all of thesequence of the nucleic acid, as defined above.

In another preferred embodiment, a method of gene replacement is setforth. “Gene replacement” as used herein means supplying a nucleic acidsequence which is capable of being expressed in vivo in an animal andthereby providing or augmenting the function of an endogenous gene whichis missing or defective in the animal.

XVI. Pharmaceutical Formulations and Modes of Administration

The particular compound, antibody, antisense or ribozyme molecule thataffects the protein complexes and the disorder of interest can beadministered to a patient either by themselves, or in pharmaceuticalcompositions where it is mixed with suitable carriers or excipient(s).

In treating a patient exhibiting an oncogenic disorder of interest, atherapeutically effective amount of a agent or agents such as these isadministered. A therapeutically effective dose refers to that amount ofthe compound that results in amelioration of symptoms or a prolongationof survival in a patient.

Toxicity and therapeutic efficacy of such compounds can be determined bystandard pharmaceutical procedures in cell cultures or experimentalanimals, e.g., for determining the LD₅₀ (the dose lethal to 50% of thepopulation) and the ED₅₀ (the dose therapeutically effective in 50% ofthe population). The dose ratio between toxic and therapeutic effects isthe therapeutic index and it can be expressed as the ratio LD₅₀/ED₅₀.Compounds which exhibit large therapeutic indices are preferred. Thedata obtained from these cell culture assays and animal studies can beused in formulating a range of dosage for use in human. The dosage ofsuch compounds lies preferably within a range of circulatingconcentrations that include the ED₅₀ with little or no toxicity. Thedosage may vary within this range depending upon the dosage formemployed and the route of administration utilized.

For any compound used in the method of the invention, thetherapeutically effective dose can be estimated initially from cellculture assays. For example, a dose can be formulated in animal modelsto achieve a circulating plasma concentration range that includes theIC₅₀ as determined in cell culture (i.e., the concentration of the testcompound which achieves a half-maximal disruption of the proteincomplex, or a half-maximal inhibition of the cellular level and/oractivity of a complex component). Such information can be used to moreaccurately determine useful doses in humans. Levels in plasma may bemeasured, for example, by HPLC.

The exact formulation, route of administration and dosage can be chosenby the individual physician in view of the patient's condition. (Seee.g. Fingl et al., in The Pharmacological Basis of Therapeutics, 1975,Ch. 1 p. 1).

It should be noted that the attending physician would know how to andwhen to terminate, interrupt, or adjust administration due to toxicity,or to organ dysfunctions. Conversely, the attending physician would alsoknow to adjust treatment to higher levels if the clinical response werenot adequate (precluding toxicity). The magnitude of an administrateddose in the management of the oncogenic disorder of interest will varywith the severity of the condition to be treated and to the route ofadministration. The severity of the condition may, for example, beevaluated, in part, by standard prognostic evaluation methods. Further,the dose and perhaps dose frequency, will also vary according to theage, body weight, and response of the individual patient. A programcomparable to that discussed above may be used in veterinary medicine.

Depending on the specific conditions being treated, such agents may beformulated and administered systemically or locally. Techniques forformulation and administration may be found in Remington'sPharmaceutical Sciences, 18th ed., Mack Publishing Co., Easton, Pa.(1990). Suitable routes may include oral, rectal, transdermal, vaginal,transmucosal, or intestinal administration; parenteral delivery,including intramuscular, subcutaneous, intramedullary injections, aswell as intrathecal, direct intraventricular, intravenous,intraperitoneal, intranasal, or intraocular injections, just to name afew.

For injection, the agents of the invention may be formulated in aqueoussolutions, preferably in physiologically compatible buffers such asHanks's solution, Ringer's solution, or physiological saline buffer. Forsuch transmucosal administration, penetrants appropriate to the barrierto be permeated are used in the formulation. Such penetrants aregenerally known in the art.

Use of pharmaceutically acceptable carriers to formulate the compoundsherein disclosed for the practice of the invention into dosages suitablefor systemic administration is within the scope of the invention. Withproper choice of carrier and suitable manufacturing practice, thecompositions of the present invention, in particular, those formulatedas solutions, may be administered parenterally, such as by intravenousinjection. The compounds can be formulated readily usingpharmaceutically acceptable carriers well known in the art into dosagessuitable for oral administration. Such carriers enable the compounds ofthe invention to be formulated as tablets, pills, capsules, liquids,gels, syrups, slurries, suspensions and the like, for oral ingestion bya patient to be treated.

Agents intended to be administered intracellularly may be administeredusing techniques well known to those of ordinary skill in the art. Forexample, such agents may be encapsulated into liposomes, thenadministered as described above. Liposomes are spherical lipid bilayerswith aqueous interiors. All molecules present in an aqueous solution atthe time of liposome formation are incorporated into the aqueousinterior. The liposomal contents are both protected from the externalmicroenvironment and, because liposomes fuse with cell membranes, areefficiently delivered into the cell cytoplasm. Additionally, due totheir hydrophobicity, small organic molecules may be directlyadministered intracellularly.

Pharmaceutical compositions suitable for use in the present inventioninclude compositions wherein the active ingredients are contained in aneffective amount to achieve its intended purpose. Determination of theeffective amounts is well within the capability of those skilled in theart, especially in light of the detailed disclosure provided herein.

In addition to the active ingredients, these pharmaceutical compositionsmay contain suitable pharmaceutically acceptable carriers comprisingexcipients and auxiliaries which facilitate processing of the activecompounds into preparations which can be used pharmaceutically. Thepreparations formulated for oral administration may be in the form oftablets, dragees, capsules, or solutions.

The pharmaceutical compositions of the present invention may bemanufactured in a manner that is itself known, e.g., by means ofconventional mixing, dissolving, granulating, dragee-making, levigating,emulsifying, encapsulating, entrapping or lyophilizing processes.

Pharmaceutical formulations for parenteral administration includeaqueous solutions of the active compounds in water-soluble form.Additionally, suspensions of the active compounds may be prepared asappropriate oily injection suspensions. Suitable lipophilic solvents orvehicles include fatty oils such as sesame oil, or synthetic fatty acidesters, such as ethyl oleate or triglycerides, or liposomes. Aqueousinjection suspensions may contain substances which increase theviscosity of the suspension, such as sodium carboxymethyl cellulose,sorbitol, or dextran. Optionally, the suspension may also containsuitable stabilizers or agents which increase the solubility of thecompounds to allow for the preparation of highly concentrated solutions.

Pharmaceutical preparations for oral use can be obtained by combiningthe active compounds with solid excipient, optionally grinding aresulting mixture, and processing the mixture of granules, after addingsuitable auxiliaries, if desired, to obtain tablets or dragee cores.Suitable excipients are, in particular, fillers such as sugars,including lactose, sucrose, mannitol, or sorbitol; cellulosepreparations such as, for example, maize starch, wheat starch, ricestarch, potato starch, gelatin, gum tragacanth, methyl cellulose,hydroxypropylmethyl-cellulose, sodium carboxy-methylcellulose, and/orpolyvinylpyrrolidone (PVP). If desired, disintegrating agents may beadded, such as the cross-linked polyvinyl pyrrolidone, agar, or alginicacid or a salt thereof such as sodium alginate.

Dragee cores are provided with suitable coatings. For this purpose,concentrated sugar solutions may be used, which may optionally containgum arabic, talc, polyvinyl pyrrolidone, carbopol gel, polyethyleneglycol, and/or titanium dioxide, lacquer solutions, and suitable organicsolvents or solvent mixtures. Dyestuffs or pigments may be added to thetablets or dragee coatings for identification or to characterizedifferent combinations of active compound doses.

Pharmaceutical preparations which can be used orally include push-fitcapsules made of gelatin, as well as soft, sealed capsules made ofgelatin and a plasticizer, such as glycerol or sorbitol. The push-fitcapsules can contain the active ingredients in admixture with fillersuch as lactose, binders such as starches, and/or lubricants such astalc or magnesium stearate and, optionally, stabilizers. In softcapsules, the active compounds may be dissolved or suspended in suitableliquids, such as fatty oils, liquid paraffin, or liquid polyethyleneglycols. In addition, stabilizers may be added.

The nucleic acid sequence encoding MDK1 can be administeredprophylactically, or to patients having a disorder listed above, e.g.,by exogenous delivery of the nucleic acid sequence encoding MDK1 asnaked DNA, DNA associated with specific carriers, or in a nucleic acidexpression vector to a desired tissue by means of an appropriatedelivery vehicle, e.g., a liposome, by use of iontophoresis,electroporation and other pharmacologically approved methods ofdelivery. Routes of administration may include intramuscular,intravenous, aerosol, oral (tablet or pill form), topical, systemic,ocular, as a suppository, intraperitoneal and/or intrathecal.

Some methods of delivery that may be used include:

a. encapsulation in liposomes,

b. transduction by retroviral vectors,

c. localization to nuclear compartment utilizing nuclear targeting sitefound on most nuclear proteins,

d. transfection of cells ex vivo with subsequent reimplantation oradministration of the transfected cells,

e. a DNA transporter system.

A MDK1 nucleic acid sequence may be administered utilizing an ex vivoapproach whereby cells are removed from an animal, transduced with theMDK1 nucleic acid sequence and reimplanted into the animal. The livercan be accessed by an ex vivo approach by removing hepatocytes from ananimal, transducing the hepatocytes in vitro with the MDK1 nucleic acidsequence and reimplanting them into the animal (e.g., as described forrabbits by Chowdhury et al, Science 254: 1802-1805, 1991, or in humansby Wilson, Hum. Gene Ther. 3: 179-222, 1992) incorporated herein byreference.

Many nonviral techniques for the delivery of a MDK1 nucleic acidsequence into a cell can be used, including direct naked DNA uptake(e.g., Wolff et al., Science 247: 1465-1468, 1990), receptor-mediatedDNA uptake, e.g., using DNA coupled to asialoorosomucoid which is takenup by the asialoglycoprotein receptor in the liver (Wu and Wu, J. Biol.Chem. 262: 4429-4432, 1987; Wu et al., J. Biol. Chem. 266: 14338-14342,1991), and liposome-mediated delivery (e.g., Kaneda et al., Expt. CellRes. 173: 56-69, 1987; Kaneda et al., Science 243: 375-378, 1989; Zhu etal., Science 261: 209-211, 1993). Many of these physical methods can becombined with one another and with viral techniques; enhancement ofreceptor-mediated DNA uptake can be effected, for example, by combiningits use with adenovirus (Curiel et al., Proc. Natl. Acad. Sci. USA 88:8850-8854, 1991; Cristiano et al., Proc. Natl. Acad. Sci. USA 90:2122-2126, 1993).

The MDK1 or nucleic acid encoding MDK1 may also be administered via animplanted device that provides a support for growing cells. Thus, thecells may remain in the implanted device and still provide the usefuland therapeutic agents of the present invention.

XVII. Identification of Agents

The complexes, components of such complexes, functional equivalentsthereof, and/or cell lines that express such components and exhibit suchprotein complexes may be used to screen for additional compounds,antibodies, or other molecules capable of modulating the signaltransduction event such complexes are involved in. Methods for purifyingand/or producing such complexes, components of the complexes, functionalequivalents thereof, and/or cell lines are described herein. Thecompounds, antibodies, or other molecules identified may, for example,act to disrupt the protein complexes of the invention (i.e., decrease orinhibit interactions between component members of the complexes, therebycausing physical separation of the components, and/or perturbing theactivity of the complexes) or may lower the cellular level and/ordecrease the activity of one or more of the components of suchcomplexes.

Such compounds may include, but are not limited to, peptides made of D-and/or L-configuration amino acids (in, for example, the form of randompeptide libraries; see Lam et al., Nature 354:82-84, 1991),phosphopeptides (in, for example, the form of random or partiallydegenerate, directed phosphopeptide libraries, see Songyang et al., Cell767-778, 1993), antibodies, and small organic or inorganic molecules.Synthetic compounds, natural products, and other sources of potentiallybiologically active materials may be screened in a variety of ways, asdescribed herein. The compounds, antibodies, or other moleculesidentified may be used as oncogenic disorder treatments, as describedherein.

Compounds that bind to individual components, or functional portions ofthe individual components of the complexes (and may additionally becapable of disrupting complex formation) may be identified.

One such method included within the scope of the invention is a methodfor identifying an agent to be tested for an ability to modulate asignal transduction pathway disorder. The method involves exposing atleast one agent to a protein comprising a functional portion of a memberof the protein complex for a time sufficient to allow binding of theagent to the functional portion of the member; removing non-boundagents; and determining the presence of the compound bound to thefunctional portion of the member of the protein complex, therebyidentifying an agent to be tested for an ability to modulate a disorderinvolving a polypeptide complex.

By “signal transduction disorder” is meant any disease or conditionassociated with an abnormality in a signal transduction pathway. Theprotein complex referred to below is a physical association of a MDK1receptor tyrosine kinase and a MDK1 binding partner. The level ofinteraction between the two components of the complex may be abnormaland thus cause the abnormality in the signal transduction pathway.Alternatively, the level of interaction between the complex componentsmay be normal, but affecting that interaction may effectively treat asignal transduction pathway disorder.

The term “protein” refers to a compound formed of 5-50 or more aminoacids joined together by peptide bonds. An “amino acid” is a subunitthat is polymerized to form proteins and there are twenty amino acidsthat are universally found in proteins. The general formula for an aminoacid is H₂N—CHR—COOH, in which the R group can be anything from ahydrogen atom (as in the amino acid glycine) to a complex ring (as inthe amino acid tryptophan).

A functional portion of an individual component of the complexes may bedefined here as a protein portion of an individual component of acomplex still capable of forming a stable complex with another member ofthe complex under standard cellular and physiological conditions. Forexample, a functional portion of a component may include, but is notlimited to, a protein portion of MDK1 which is still capable of stablybinding a corresponding binding partner domain of an associated protein,and thus is still capable of forming a complex with that protein.Further, in the case of the catalytic domains of the individualcomponents of the invention, a functional portion of a catalytic domainmay refer to a protein still capable of stably binding a substratemolecule under standard physiological conditions.

One method utilizing this approach that may be pursued in the isolationof such complex component-binding molecules would include the attachmentof a component molecule, or a functional portion thereof, to a solidmatrix, such as agarose or plastic beads, microtiter wells, petridishes, or membranes composed of, for example, nylon or nitrocellulose,and the subsequent incubation of the attached component molecule in thepresence of a potential component-binding compound or compounds.Attachment to said solid support may be direct or by means of acomponent specific antibody bound directly to the solid support. Afterincubation, unbound compounds are washed away, component-bound compoundsare recovered. By utilizing this procedure, large numbers of types ofmolecules may be simultaneously screened for complex component-bindingactivity.

The complex components which may be utilized in the above screeningmethod may include, but are not limited to, molecules or functionalportions thereof, such as catalytic domains, phosphorylation domains,extracellular domains, or portions of extracellular domains, such asligand-binding domains, and adaptor proteins, or functional portionsthereof. The peptides used may be phosphorylated, e.g., may contain atleast one phosphorylated amino acid residue, preferably a phosphorylatedTyr amino acid residue, or may be unphosphorylated. A phosphorylationdomain may be defined as a peptide region that is specificallyphosphorylated at certain amino acid residues. A functional portion ofsuch a phosphorylation domain may be defined as a peptide capable ofbeing specifically phosphorylated at certain amino acids by a specificprotein.

Molecules exhibiting binding activity may be further screened for anability to disrupt protein complexes. Alternatively, molecules may bedirectly screened for an ability to promote the complexes. For example,in vitro complex formation may be assayed by, first, immobilizing onecomponent, or a functional portion thereof, of the complex of interestto a solid support. Second, the immobilized complex component may beexposed to a compound such as one identified as above, and to the secondcomponent, or a functional portion thereof, of the complex of interest.Third, it may be determined whether or not the second component is stillcapable of forming a complex with the immobilized component in thepresence of the compound. In addition, one could look for an increase inbinding.

Additionally, complex formation in a whole cell may be assayed byutilizing co-immunoprecipitation techniques well known to those of skillin the art. Briefly, a cell line capable of forming a complex ofinterest may be exposed to a compound such as one identified as above,and a cell lysate may be prepared from this exposed cell line. Anantibody raised against one of the components of the complex of interestmay be added to the cell lysate, and subjected to standardimmunoprecipitation techniques. In cases where a complex is stillformed, the immunoprecipitation will precipitate the complex, whereas incases where the complex has been disrupted, only the complex componentto which the antibody is raised will be precipitated.

A preferred method for assessing modulation of complex formation withina cell utilizes a method similar to that described above. Briefly, acell line capable of forming a complex of interest is exposed to a testcompound. The cells are lysed and the lysate contacted with an antibodyspecific to one component of the complex, said antibody having beenpreviously bound to a solid support. Unbound material is washed away,and the bound material is exposed to a second antibody, said secondantibody binding specifically to a second component of the complex. Theamount of second antibody bound is easily detected by techniques wellknown in the art. Cells exposed to an inhibitory test compound will haveformed a lesser amount of complex compared to cells not exposed to thetest compound, as measured by the amount of second antibody bound. Cellsexposed to a test compound that promotes complex formation will have anincreased amount of second antibody bound.

The effect of an agent on the differentiation capability of the complexof interest may be directly assayed. Such agents may, but are notrequired to, include those agents identified by utilizing the abovescreening technique. For example, an agent or agents may be administeredto a cell such as a neuronal cell, capable of forming a complex, forexample, which, in the absence of any agent, would not lead to thecell's differentiation. The differentiation state of the cell may thenbe measured either in vitro or in vivo. One method of measurement mayinvolve observing the amount of neurile growth present.

Agents capable of disrupting complex formation and capable of reducingor inhibiting disorders, which involve the formation of such complexes,or which involve the lack of formation of such complexes, may be used inthe treatment of patients exhibiting or at risk for such disorders. Asufficient amount of agent or agents such as those described above maybe administered to a patient so that the symptoms of the disease orcondition are reduced or eliminated.

XVIII. Purification and Production of Complexes

Described in this Section are methods for the synthesis or recombinantexpression of components, or fragments thereof, of the protein complexesof the invention. Also described herein are methods by which cellsexhibiting the protein complexes of the invention may be engineered.

XIX. Purification Methods

The complexes of the invention may be substantially purified, i.e., maybe purified away from at least 90% (on a weight basis), and from atleast 99%, if desired, of other proteins, glycoproteins, and othermacromolecules with which it in associated. Such purification can beachieved by utilizing a variety of procedures well known to those ofskill in the art, such as subjecting cells, tissue or fluid containingthe complex to a combination of standard methods, for example, ammoniumsulfate precipitation, molecular sieve chromatography, and/or ionexchange chromatography.

Alternatively, or additionally, a complex may be purified byimmunoaffinity chromatography using an immunoadsorbent column to whichan antibody is immobilized which is capable of binding to one or morecomponents of the complex. Such an antibody may be monoclonal orpolyclonal in origin. Other useful types of affinity purification forthe protein complex may utilize, for example, a solid-phase substratewhich binds the catalytic kinase domain of a protein, or an immobilizedbinding site for noncatalytic domains of the components of the complex,which bind in such a manner as to not disrupt the complex. The complexof the present invention may be biochemically purified from a variety ofcell or tissue sources.

XX. Synthesis and Expression Methods

Methods for the synthesis of polypeptides or fragments thereof, whichare capable of acting as components of the complexes of the presentinvention, are well-known to those of ordinary skill in the art. See,for example, Creighton, Proteins: Structures and Molecular Principles,W. H. Freeman and Co., NY (1983), which is incorporated herein, byreference, in its entirety.

Components of a complex which have been separately synthesized orrecombinantly produced, may be reconstituted to form a complex bystandard biochemical techniques well known to those skilled in the art.For example, samples containing the components of the complex may becombined in a solution buffered with greater than about 150 mM NaCl, ata physiological pH in the range of 7, at room temperature. For example,a buffer comprising 20 mM Tris-HCl, pH 7.4, 137 mM NaCl, 10% glycerol,1% Triton X-100, 0.1% SDS, 0.5% deoxycholate and 2 mM EDTA could beused.

Methods for preparing the components of complexes of the invention byexpressing nucleic acid encoding proteins are described herein. Methodswhich are well known to those skilled in the art can be used toconstruct expression vectors containing protein coding sequences andappropriate transcriptional and translational control signals. Thesemethods include, for example, in vitro recombinant DNA techniques,synthetic techniques and in vivo recombination/genetic recombination.DNA and RNA synthesis may, additionally, be performed using an automatedsynthesizers. See, for example, the techniques described in Maniatis etal., Molecular Cloning: A Laboratory Manual, Cold Spring HarborLaboratory, N.Y. (1989), and in Ausubel et al., Current Protocols inMolecular Biology, Greene Publishing Associates and Wiley Interscience,N.Y. (1989).

A variety of host-expression vector systems may be utilized to expressthe coding sequences of the components of the complexes of theinvention. Such host-expression systems represent vehicles by which thecoding sequences of interest may be produced, but also represent cellswhich may, when transformed or transfected with the appropriatenucleotide coding sequences, exhibit the protein complexes of theinvention. These include but are not limited to microorganisms such asbacteria (e.g., E.coli, B. subtilis) transformed with recombinantbacteriophage DNA, plasmid DNA or cosmid DNA expression vectorscontaining protein coding sequences; yeast (e.g., Saccharomyces andPichia) transformed with recombinant yeast expression vectors containingthe protein coding sequences; insect cell systems infected withrecombinant virus expression vectors (e.g., baculovirus) containing theprotein coding sequences; plant cell systems infected with recombinantvirus expression vectors (e.g., cauliflower mosaic virus, CaMV; tobaccomosaic virus, TMV) or transformed with recombinant plasmid expressionvectors (e.g., Ti plasmid) containing the protein coding sequencescoding sequence; or mammalian cell systems (e.g., COS, CHO, BHK, 293,3T3) harboring recombinant expression constructs containing promotersderived from the genome of mammalian cells (e.g., metallothioneinpromoter) or from mammalian viruses (e.g., the adenovirus late promoter;the vaccinia virus 7.5K promoter).

In bacterial systems a number of expression vectors may beadvantageously selected depending upon the use intended for the complexbeing expressed. For example, when large quantities of complex proteinsare to be produced for the generation of antibodies or to screen peptidelibraries, vectors which direct the expression of high levels of fusionprotein products that are readily purified may be desirable. Suchvectors include but are not limited to the E. coli expression vectorpUR278 (Ruther et al., EMBO J. 2:1791, 1983), in which the proteincoding sequence may be ligated individually into the vector in framewith the lac Z coding region so that a fusion protein is produced; pINvectors (Inouye and Inouye, Nucleic acids Res. 13:3101-3109, 1985; VanHeeke & Schuster, J. Biol. Chem. 264:5503-5509, 1989); and the like.pGEX vectors may also be used to express foreign polypeptides as fusionproteins with glutathione S-transferase (GST). In general, such fusionproteins are soluble and can easily be purified from lysed cells byadsorption to glutathione-agarose beads followed by elution in thepresence of free glutathione. The pGEX vectors are designed to includethrombin or factor Xa protease cleavage sites so that the cloned proteincan be released from the GST moiety.

In an insect system, Autographa californica nuclear polyhedrosis virus(AcNPV) is used as a vector to express foreign genes. The virus grows inSpodoptera frugiperda cells. The complex coding sequence may be clonedindividually into non-essential regions (for example the polyhedringene) of the virus and placed under control of an AcNPV promoter (forexample the polyhedrin promoter). Successful insertion of thePTK/adaptor complex coding sequence will result in inactivation of thepolyhedrin gene and production of non-occluded recombinant virus (i.e.,virus lacking the proteinaceous coat coded for by the polyhedrin gene).These recombinant viruses are then used to infect Spodoptera frugiperdacells in which the inserted gene is expressed (e.g., see Smith et al.,J. Biol. 46:584, 1983; Smith, U.S. Pat. No. 4,215,051).

In mammalian host cells, a number of viral based expression systems maybe utilized. In cases where an adenovirus is used as an expressionvector, the complex coding sequence may be ligated to an adenovirustranscription/translation control complex, e.g., the late promoter andtripartite leader sequence. This chimeric gene may then be inserted inthe adenovirus genome by in vitro or in vivo recombination. Insertion ina non-essential region of the viral genome (e.g., region E1 or E3) willresult in a recombinant virus that is viable and capable of expressingproteins in infected hosts. (E.g., See Logan & Shenk, Proc. Natl. Acad.Sci. USA 81:3655-3659, 1984) Specific initiation signals may also berequired for efficient translation of inserted coding sequences. Thesesignals include the ATG initiation codon and adjacent sequences.

In cases where an entire protein gene, including its own initiationcodon and adjacent sequences, is inserted into the appropriateexpression vector, no additional translational control signals may beneeded. However, in cases where only a portion of the coding sequence isinserted, exogenous translational control signals, including the ATGinitiation codon, must be provided. Furthermore, the initiation codonmust be in phase with the reading frame of the desired coding sequenceto ensure translation of the entire insert. These exogenoustranslational control signals and initiation codons can be of a varietyof origins, both natural and synthetic. The efficiency of expression maybe enhanced by the inclusion of appropriate transcription enhancerelements, transcription terminators, etc. (see Bittner et al., Methodsin Enzymol. 153:516-544, 1987)

In addition, a host cell strain may be chosen which modulates theexpression of the inserted sequences, or modifies and processes the geneproduct in the specific fashion desired. Such modifications (e.g.,glycosylation) and processing (e.g., cleavage) of protein products maybe important for the function of the protein. Different host cells havecharacteristic and specific mechanisms for the post-translationalprocessing and modification of proteins. Appropriate cells lines or hostsystems can be chosen to ensure the correct modification and processingof the foreign protein expressed. To this end, eukaryotic host cellswhich possess the cellular machinery for proper processing of theprimary transcript, glycosylation, and phosphorylation of the geneproduct may be used. Such mammalian host cells include but are notlimited to CHO, VERO, BHK, HeLa, COS, MDCK, 293, 3T3, WI38, etc.

For long-term, high-yield production of recombinant proteins, stableexpression is preferred. For example, cell lines which stably coexpressboth the proteins may be engineered. Rather than using expressionvectors which contain viral origins of replication, host cells can betransformed with the protein encoding DNA independently or coordinatelycontrolled by appropriate expression control elements (e.g., promoter,enhancer, sequences, transcription terminators, polyadenylation sites,etc.), and a selectable marker.

Following the introduction of foreign DNA, engineered cells may beallowed to grow for 1-2 days in an enriched media, and then are switchedto a selective media. The selectable marker in the recombinant plasmidconfers resistance to the selection and allows cells to stably integratethe plasmid into their chromosomes and grow to form foci which in turncan be cloned and expanded into cell lines. This method mayadvantageously be used to engineer cell lines which coexpress both thePTK and adaptor protein. Such engineered cell lines are particularlyuseful in screening and evaluation of compounds that affect signalsmediated by the complexes.

A number of selection systems may be used, including but not limited tothe herpes simplex virus thymidine kinase (Wigler et al., Cell 11:223,1977), hypoxanthine-guanine phosphoribosyltransferase (Szybalska &Szybalski, Proc. Natl. Acad. Sci. USA 48:2026, 1962), and adeninephosphoribosyltransferase (Lowy et al., Cell 22:817, 1980) genes can beemployed in tk⁻, hgprt⁻ or aprt⁻ cells, respectively. Also,antimetabolite resistance can be used as the basis of selection fordhfr, which confers resistance to methotrexate (Wigler et al., Natl.Acad. Sci. USA 77:3567, 1980; O'Hare et al., Proc. Natl. Acad. Sci. USA78:1527, 1981); gpt, which confers resistance to mycophenolic acid(Mulligan & Berg, Proc. Natl. Acad. Sci. USA 78:2072, 1981); neo, whichconfers resistance to the aminoglycoside G-418 (Colberre-Garapin et al.,J. Mol. Biol. 150:1, 1981); and hygro, which confers resistance tohygromycin (Santerre et al. Gene 30:147, 1984) genes.

New members of the protein families capable of forming the complexes ofthe invention may be identified and isolated by molecular biologicaltechniques well known in the art. For example, a previously unknownprotein encoding gene may be isolated by performing a polymerase chainreaction (PCR) using two degenerate oligonucleotide primer poolsdesigned on the basis of highly conserved sequences within domainscommon to members of the protein family.

The template for the reaction may be cDNA obtained by reversetranscription of mRNA prepared from cell lines or tissue known toexpress complexes. The PCR product may be subcloned and sequenced toinsure that the amplified sequences represent the sequences of a memberof the PTK or adaptor subfamily. The PCR fragment may then be used toisolate a full length protein cDNA clone by radioactively labeling theamplified fragment and screening a bacteriophage cDNA library.Alternatively, the labeled fragment may be used to screen a genomiclibrary. For a review of cloning strategies which may be used. See e.g.,Maniatis, Molecular Cloning: A Laboratory Manual, Cold Springs HarborPress, N.Y. (1989); and Ausubel et al., Current Protocols in MolecularBiology, Green Publishing Associates and Wiley Interscience, N.Y.(1989). A general method for cloning previously unknown proteins hasbeen described by Skolnik (Skolnik, E. Y., Cell 65:75, 1991) and Skolniket al., (U.S. patent application Ser. No. 07/643,237) which areincorporated herein by reference, in their entirety, including drawings.

XXI. Derivatives of Complexes

Also provided herein are functional derivatives of a complex. By“functional derivative” is meant a “chemical derivative,” “fragment,”“variant,” “chimera,” or “hybrid” of the complex, which terms aredefined below. A functional derivative retains at least a portion of thefunction of the protein, for example reactivity with an antibodyspecific for the complex, enzymatic activity or binding activitymediated through noncatalytic domains, which permits its utility inaccordance with the present invention.

A “chemical derivative” of the complex contains additional chemicalmoieties not normally a part of the protein. Covalent modifications ofthe protein complex or peptides are included within the scope of thisinvention. Such modifications may be introduced into the molecule byreacting targeted amino acid residues of the peptide with an organicderivatizing agent that is capable of reacting with selected side chainsor terminal residues, as described below.

Cysteinyl residues most commonly are reacted with alpha-haloacetates(and corresponding amines), such as chloroacetic acid orchloroacetamide, to give carboxymethyl or carboxyamidomethylderivatives. Cysteinyl residues also are derivatized by reaction withbromotrifluoroacetone, chloroacetyl phosphate, N-alkylmaleimides,3-nitro-2-pyridyl disulfide, methyl 2-pyridyl disulfide,p-chloromercuribenzoate, 2-chloromercuri-4-nitrophenol, orchloro-7-nitrobenzo-2-oxa-1,3-diazole.

Histidyl residues are derivatized by reaction with diethylprocarbonateat pH 5.5-7.0 because this agent is relatively specific for the histidylside chain. Parabromophenacyl bromide also is useful; the reaction ispreferably performed in 0.1 M sodium cacodylate at pH 6.0.

Lysinyl and amino terminal residues are reacted with succinic or othercarboxylic acid anhydrides. Derivatization with these agents has theeffect or reversing the charge of the lysinyl residues. Other suitablereagents for derivatizing primary amine containing residues includeimidoesters such as methyl picolinimidate; pyridoxal phosphate;pyridoxal; chloroborohydride; trinitrobenzenesulfonic acid;O-methylisourea; 2,4 pentanedione; and transaminase-catalyzed reactionwith glyoxylate.

Arginyl residues are modified by reaction with one or severalconventional reagents, among them phenylglyoxal, 2,3-butanedione,1,2-cyclohexanedione, and ninhydrin. Derivatization of arginine residuesrequires that the reaction be performed in alkaline conditions becauseof the high pK_(a) of the guanidine functional group. Furthermore, thesereagents may react with the groups of lysine as well as the argininealpha-amino group.

Tyrosyl residues are well-known targets of modification for introductionof spectral labels by reaction with aromatic diazonium compounds ortetranitromethane. Most commonly, N-acetylimidizol and tetranitromethaneare used to form O-acetyl tyrosyl species and 3-nitro derivatives,respectively.

Carboxyl side groups (aspartyl or glutamyl) are selectively modified byreaction carbodiimide (R′—N—C—N—R′) such as1-cyclohexyl-3-(2-morpholinyl(4-ethyl) carbodiimide or1-ethyl-3-(4-azonia-4,4-dimethylpentyl) carbodiimide. Furthermore,aspartyl and glutamyl residue are converted to asparaginyl andglutaminyl residues by reaction with ammonium ions.

Glutaminyl and asparaginyl residues are frequently deamidated to thecorresponding glutamyl and aspartyl residues. Alternatively, theseresidues are deamidated under mildly acidic conditions. Either form ofthese residues falls within the scope of this invention.

Derivatization with bifunctional agents is useful, for example, forcross-linking the component peptides of the complexes to each other orthe complex to a water-insoluble support matrix or to othermacromolecular carriers. Commonly used cross-linking agents include, forexample, 1,1-bis(diazoacetyl)-2-phenylethane, glutaraldehyde,N-hydroxysuccinimide esters, for example, esters with 4-azidosalicylicacid, homobifunctional imidoesters, including disuccinimidyl esters suchas 3,3′-dithiobis-(succinimidylpropionate), and bifunctional maleimidessuch as bis-N-maleimido-1,8-octane. Derivatizing agents such asmethyl-3-[p-azidophenyl) dithiolpropioimidate yield photoactivatableintermediates that are capable of forming crosslinks in the presence oflight. Alternatively, reactive water-insoluble matrices such as cyanogenbromide-activated carbohydrates and the reactive substrates described inU.S. Pat. Nos. 3,969,287; 3,691,016; 4,195,128; 4,247,642; 4,229,537;and 4,330,440 are employed for protein immobilization.

Other modifications include hydroxylation of proline and lysine,phosphorylation of hydroxyl groups of seryl or threonyl residues,methylation of the alpha-amino groups of lysine, arginine, and histidineside chains (Creighton, T. E., Proteins: Structure and MolecularProperties, W. H. Freeman & Co., San Francisco, pp. 79-86 (1983)),acetylation of the N-terminal amine, and, in some instances, amidationof the C-terminal carboxyl groups.

Such derivatized moieties may improve the stability, solubility,absorption, biological half life, and the like. The moieties mayalternatively eliminate or attenuate any undesirable side effect of theprotein complex and the like. Moieties capable of mediating such effectsare disclosed, for example, in Reminqton's Pharmaceutical Sciences, 18thed., Mack Publishing Co., Easton, Pa. (1990).

The term “fragment” is used to indicate a polypeptide derived from theamino acid sequence of the proteins, of the complexes having a lengthless than the full-length polypeptide from which it has been derived.Such a fragment may, for example, be produced by proteolytic cleavage ofthe full-length protein. Preferably, the fragment is obtainedrecombinantly by appropriately modifying the DNA sequence encoding theproteins to delete one or more amino acids at one or more sites of theC-terminus, N-terminus, and/or within the native sequence. Fragments ofa protein, when present in a complex resembling the naturally occurringcomplex, are useful for screening for compounds that act to modulatesignal transduction, as described below. It is understood that suchfragments, when present in a complex may retain one or morecharacterizing portions of the native complex. Examples of such retainedcharacteristics include: catalytic activity; substrate specificity;interaction with other molecules in the intact cell; regulatoryfunctions; or binding with an antibody specific for the native complex,or an epitope thereof.

Another functional derivative intended to be within the scope of thepresent invention is a complex comprising at least one “variant”polypeptide which either lack one or more amino acids or containadditional or substituted amino acids relative to the nativepolypeptide. The variant may be derived from a naturally occurringcomplex component by appropriately modifying the protein DNA codingsequence to add, remove, and/or to modify codons for one or more aminoacids at one or more sites of the C-terminus, N-terminus, and/or withinthe native sequence. It is understood that such variants having added,substituted and/or additional amino acids retain one or morecharacterizing portions of the native complex, as described above.

A functional derivative of complexes comprising proteins with deleted,inserted and/or substituted amino acid residues may be prepared usingstandard techniques well-known to those of ordinary skill in the art.For example, the modified components of the functional derivatives maybe produced using site-directed mutagenesis techniques (as exemplifiedby Adelman et al., 1983, DNA 2:183) wherein nucleotides in the DNAcoding the sequence are modified such that a modified coding sequence ismodified, and thereafter expressing this recombinant DNA in aprokaryotic or eukaryotic host cell, using techniques such as thosedescribed above. Alternatively, components of functional derivatives ofcomplexes with amino acid deletions, insertions and/or substitutions maybe conveniently prepared by direct chemical synthesis, using methodswell-known in the art. The functional derivatives of the complexestypically exhibit the same qualitative biological activity as the nativecomplexes.

Other functional derivatives include mutant, species and allelicvariations.

By “mutant variation” is meant a nucleic acid or amino acid moleculethat results from any detectable change in the genetic material whichmay be transmitted to daughter cells giving rise to mutant cells,including nucleic acids or polypeptides having nucleotides or aminoacids that are added, deleted, substituted for, inverted, or transposedto new positions with and without inversion. The mutant variation mayoccur spontaneously or may be induced experimentally by application ofmutagens and may result from any (or a combination of) detectable,unnatural change affecting the chemical or physical constitution,mutability, replication, phenotypic function, or recombination of one ormore deoxyribonucleotides.

By “species variation” is meant a change in the nucleic acid or aminoacid sequence that occurs among species and may be determined by DNAsequencing of the molecule in question.

By “allelic variation” is meant an alternative functional derivation ofthe typical form of a gene in an organism occupying a given locus on achromosome.

XXII. Evaluation of Disorders

The protein complexes of the invention involved in disorders may beutilized in developing a prognostic evaluation of the condition of apatient suspected of exhibiting such a disorder. For example, biologicalsamples obtained from patients suspected of exhibiting a disorderinvolving a protein complex may be assayed for the presence of suchcomplexes. If such a protein complex is normally present, and thedevelopment of the disorder is caused by an abnormal quantity of thecomplex, the assay should compare complex levels in the biologicalsample to the range expected in normal tissue of the same cell type.

Among the assays which may be undertaken may include, but are notlimited to isolation of the protein complex of interest from thebiological sample, or assaying for the presence of the complex byexposing the sample to an antibody specific for the complex, butnon-reactive to any single, non-complexed component, and detectingwhether antibody has specifically bound.

Alternatively, one or more of the components of the protein complex maybe present in an abnormal level or in a modified form, relative to thelevel or form expected is normal, nononcogenic tissue of the same celltype. It is possible that overexpression of both components may indicatea particularly aggressive disorder. Thus, an assessment of theindividual and levels of mRNA and protein in diseased tissue cells mayprovide valuable clues as to the course of action to be undertaken intreatment of such a disorder. Assays of this type are well known tothose of skill in the art, and may include, but are not limited to,Northern blot analysis, RNAse protection assays, and PCR for determiningmRNA levels. Assays determining protein levels are also well known tothose of skill in the art, and may include, but are not limited to,Western blot analysis, immunoprecipitation, and ELISA analysis. Each ofthese techniques may also reveal potential differences in the form(e.g., the primary, secondary, or tertiary amino acid sequence, and/orpost-translational modifications of the sequence) of the component(s).

EXAMPLES

The examples below are non-limiting and are merely representative ofvarious aspects and features of the present invention. The examplesbelow demonstrate the isolation of MDK1, analysis of MDK1 Northernblots, analysis of MDK1 by in situ hybridization, and expression of MDK1in human 293 fibroblasts.

cDNA Cloning of MDK1

PCR reactions were performed on cDNA prepared with poly(A⁺) RNA from13.5 day old mouse embryos using degenerate oligonucleotide primersaccording to Lai and Lemke (Lai and Lemke, Neuron 6:691-704, 1991).These primers correspond to the conserved peptide motifs HRDLAA (SEQ.I.D. NO. 9) and D(V/M)WS(F/Y)G (SEQ. I.D. NO. 10) of the kinasecatalytic domain. One of the isolated PCR fragments coding for 68 aminoacids of the catalytic domain of MDK1 was subsequently used in cDNAlibrary screening of a mouse embryo cDNA library (11.5 day embryo,Clontech) and a mouse adult brain library (Clontech).

DNA sequencing was performed according to the dideoxynucleotide chaintermination method (Sanger et al., Proc. Natl. Acad. Sci. USA74:5463-5467, 1977) using sequenase enzyme, reagents and protocolssupplied by United States Biochemical Corporation. Ambiguous sequenceswere sequenced using the dITP extension of the sequenase sequencing kit.Comparisons of inferred MDK1 protein sequence with various sequencedatabases were done using the TFASTA program (Pearson and Lipman, Proc.Natl. Acad. Sci. USA 85:2444-2448, 1988) provided with the GCG sequenceanalysis software package (Genetics Computer Group, Wisconsin).

RNA Extraction and Northern Analysis

Balb/c mice were mated overnight and the morning of vaginal plugdetection was defined as 0.5 day of gestation. For Northern blotanalysis, RNA was extracted from the frozen embryos according to theacidic phenol method of Chomczynski and Sacchi (Chomczynski and Sacchi,Analytical Biochemistry 162:156-159, 1987). Oligo(dT)columnchromatography was used for the selection of poly(A⁺) RNA. Aliquots wereelectrophoresed in 1.2% agarose formaldehyde (Sambrook et al., Molecularcloning—A Laboratory Manual. 2nd ed., Cold Spring Harbor LaboratoryPress, New York, 1989) gels and transferred to nitrocellulose membranes.Hybridizations were performed overnight in 50% formamide, 5×SSC (750 mMsodium chloride, 75 mM sodium citrate), 5×Denhardt's (0.1% Ficoll 400,0.1% polyvinylpyrollidone, 0.1% BSA) and 50 mM NaPO₄ (pH6.8) at 42° C.with 1-3×10⁶ cpm/ml of ³²P-random primed DNA probe, followed by highstringency washes in 0.2×SSC, 0.2% SDS at 50° C. The filters wereexposed for 14 days. For repeated use, filters were stripped of anylabeled DNA by incubation in destillated water at 95° C. for 10 minutes.

Preparation of Antisera

The C-terminal 110 amino acids of MDK1 were subcloned into the fusionprotein expression vector pGEX1 (Smith and Johnson, Gene 67:31-40, 1988;Pharmacia). The fusion protein was purified as described and used forimmunizing rabbits.

Transient Expression of MDK1 in 293 Cells

Transfection of human embryonic kidney fibro-blast 293 cells (ATCC CRL1573) was performed as described by Chen and Okayama (Chen and Okayama,Mol. Cell. Biol. 7:2745-2752, 1987). The cDNA encoding MDK1, MDK1Δ1 orMDK1Δ2 was subcloned into an expression vector under the control of theimmediate early cytomegalovirus promoter. CsCl-purified DNA was used fortransfections. Treatment and lysis of the cells were performed asdescribed by Lammers et al. (Lammers et al., J. Biol. Chem.268:22456-22462, 1993).

For immunoprecipitations, Protein-A sepharose and either 2 μl of theantiserum against the C-term of MDK1 or 5 μg of the mouse monoclonalanti-phosphotyrosine antibody 5E.2 (Fendly et al., Cancer Research50:1550-1558, 1990) were added to the cell lysate and incubated at 4° C.for 3 hours. The immunoprecipitates were washed with HNTG buffer (20 mMHEPES, pH 7.5, 150 mM NaCl, 10% glycerol, 0.1% Triton X-100, 10 mM NaF,1 mM sodium orthovanadate), separated on a 7.5% SDS polyacrylamide gel,dried on a vacuum dryer and exposed for 1 day. For deglycosylation ofMDK1, the immunoprecipitated proteins were denatured in DenaturingBuffer and incubated with 1,000 units of PNGase F according to themanufacturer's instructions (New England Biolabs).

Preparation of Probes

For Northern blot hybridization, the indicated fragments were isolatedand prepared by labeling with α-(³²P)dATP (Amersham) by randomhexanucleotide priming (USB, Feinberg and Vogelstein, Anal. Biochem.132:6-13, 1983). A 5′-located 1,565 bp fragment corresponding to aminoacids 18 to 538 (nucleotides 282 to 1,847) of the extracellular domainof MDK1 was subcloned into the pBluescript vector (Stratagene) for useas a probe for the in situ hybridization. Single-stranded DNA probeswere prepared as described previously (Millauer et al., Cell 72:835-846,1993). Briefly, RNA transcripts were synthesized from the linearizedplasmid using T3- or T7 RNA-Polymerase (Boehringer) and the DNA wasdegraded using DNase (RNase-free preparation, Boehringer). The RNAtranscripts were used for a random-primed cDNA synthesis withα-(³⁵S)DATP (Amersham) by reverse transcription with MMLV reversetranscriptase (BRL), giving small cDNA transcripts of about 100 bp.After hydrolyzation of the RNA the probe was purified with asephadex-G50 column.

In situ Hybridization

For in situ hybridization, the embryos or the isolated brain of8-week-old female mice were embedded in Tissue-Tek (Miles), frozen onthe surface of liquid nitrogen and stored at −70° C. prior to use.Sectioning, postfixation and hybridization was performed as describedpreviously (Millauer et al., Cell 72:835-846, 1993). 10 μm-thicksections were incubated overnight with the ³⁵S-cDNA probe (finalconcentration 2×10⁴ cpm/μl) at 52° C. in a buffer containing 50%formamide, 300 mM NaCl, 10 mM Tris-HCl, 10 mM NaPO₄ (pH6.8), 5 mM EDTA,0.02% Ficoll 400, 0.02% polyvinylpyrolidone, 0.02% BSA, 10 mg/ml yeastRNA, 10% dextran sulfate, and 10 mM DTT. Posthybridization washing wasperformed at high stringency (50% formamide, 300 mM NaCl, 10 mMTris-HCl, 10 mM NaPO₄ (pH6.8), 5 mM EDTA, 10 mM DTT at 52° C.). Slideswere coated with Kodak NTB2 film emulsion and exposed for seven days.After developing, the sections were counterstained with toluidine blue.All sections were analyzed with an Olympus BH2 or SZ-PT microscope usingbrightfield and darkfield illumination.

Example 1

Isolation of MDK1

Using a PCR-based approach (Wilks et al., Gene 85:67-74, 1989; Lai andLemke, Neuron 6:691-704, 1991) with degenerate oligonucleotide primersspecific for conserved motifs of receptor tyrosine kinases, we attemptedto identify new RTKs displaying developmental regulation. Out ofapproximately 600 clones obtained through PCR on cDNA from mouse embryosof the stages 9.5 p.c., 11.5 p.c., 13.5 p.c., 15.5 p.c., 17.5 p.c., and19.5 p.c., we identified three short cDNA sequences coding forsubdomains VI to IX of the kinase domain (Hanks et al., 1988) ofpreviously unidentified RTKs.

Using the PCR-amplified MDKL fragment for screening of an 11.5 day mouseembryo and an adult mouse brain cDNA library, we isolated fifteenindependent phage clones, which were sequenced completely. Six clonesproved to include the full-length sequence of MDK1. The nucleotide andinferred amino acid sequence is shown in FIG. 1, in which the predictedinitiating methionine and the signal peptide according to Kozak (Kozak,M., Nucleic Acids Res. 12:857-872, 1984) and Heijne (Heijne, G. v.,Nucleic Acids Res. 14:4683-4690, 1986) are indicated.

MDK1 is a member of the eck/eph family of RTKs, as evidenced by thecomplete conservation of cysteine residues in the ectodomain and thepresence of tandem fibronectin type III (FN III) homology domains (FIG.1; Pasquale, E. B., Cell Regula. 2:523-534, 1991). Comparison of theectodomain sequences revealed 61% amino acid identity of MDK1 to Ehk-1and 59% to Mek4 and Sek, but only 41% to Eph. Comparison of theintracellular domains revealed 76% amino acid identity of MDK1 to Ehk-2,75% to Sek, and 55% to Eph.

One clone was found to contain an alternative 3′-untranslated region. Inthis clone, the polyadenylation signal appeared to be bypassed,resulting in a longer MDK1 transcript (FIG. 1). Four sequence variantsof MDK1 were identified: three independent clones each were identifiedencoding most of the open reading frame of MDK1 but lacking five(MDK1Δ1) or four (MDK1Δ2) amino acids (SEQ ID NOS: 11 and 12respectively) (see FIG. 2B). In MDK1Δ2, the codon of one amino acid 5′of the missing nucleotide stretch was changed, resulting in an aminoacid alteration from phenylalanine to cysteine.

In addition, we isolated two clones encoding different truncated formsof MDK1, designated MDK1.T1 and MDK1.T2 (SEQ ID NOS: 3 and 5,respectively) (FIG. 2A), whose deduced amino acid sequences comprisedthe extracellular and transmembrane domains of MDK1 but contained onlytwenty amino acids of the MDK1 juxtamembrane domain. Their C-termini endwith eleven (MDK1.T1) or twenty-seven (MDK1.T2) unique amino acids.MDK1.T1, MDK1.T2, and MDK1Δ2 exhibit a variation of the MDK1 sequence atthe same nucleotide position (base 2031), indicating the location of anexon/intron border. A schematic representation of the various forms ofMDK1 found by screening the mouse adult brain cDNA library is shown inFIG. 2B.

Example 2

Northern Blot Analysis

Northern blot analysis was performed with different nucleotide probes ofthe MDK1 and MDK1.T1 cDNA (FIG. 3). FIG. 3 is a Northern blot analysisof MDK1 mRNA with various MDK1 probes. Northern blot analysis of MDK1mRNA with various MDK1 probes. 4 μg of poly(A⁺) RNA isolated from 13.5day mouse embryos were analyzed. Sizes were determined by using theresidual 28S and 18S ribosomal RNAs as internal markers and areindicated by arrowheads. FIG. 2B is a schematic representation of theorigins of the probes. The probes correspond to nucleotides 282-1847(A), 1847-2082 (B), 2029-2900 (C) and 3758-4304 (D) of MDK1 (as setforth in SEQ ID No:1) and nucleotides 2044-2666 (E) of MDK1.T1 (as setforth in SEQ ID No:3). Using the extracellular domain of MDK1 as aprobe, we identified five transcripts of 6.8, 5.7, 4.0, 3.2, and 2.6 kbin poly(A⁺) RNA from mouse embryo day 13.5 p.c. The two smallesttranscripts were also detected with a probe corresponding to thetransmembrane domain, but not with a probe corresponding to theintracellular domain of MDK1, confirming the existence of transcriptsencoding variant forms of MDK1 lacking the intracellular kinase domainfound by the cDNA library screening. The 3.2 kb transcript correspondsto MDK1.T1, whereas the lowest band probably corresponds to MDK1.T2,since the size of 2.6 kb matches that of the CDNA of MDK1.T2. The upperthree bands of 6.8, 5.7, and 4.0 kb are predicted to encode thefull-length MDK1 forms, with the 6.8 kb and 5.7 kb transcripts resultingfrom the use of an alternative polyadenylation site.

To elucidate the developmental regulation of MDK1 transcription duringmouse embryology, we performed Northern blot analysis of poly(A⁺) RNAisolated from embryonic and postnatal stages E10.5 to E18.5 and P1 toP8, respectively.

Northern blot analysis of MDK1 expression throughout mouse developmentwas performed. 4 μg poly(A⁺) RNA of mouse embryos day 12.5 to 18.5 andof the four postnatal stages day 1, 2, 4 and 8 were fractionated on an1.2% agarose gel, blotted on nitrocellulose and hybridized with a MDK1cDNA probe corresponding to bp 282-1,847 (probe A). The positions of the28S and 18S ribosomal RNA were marked, the mRNA sizes were deduced.Rehybridization of the same blot with a glyceral-dehyde-3-phosphatedehydrogenase (GAPDH) probe (Dugaczyk et al., Biochemistry,22:1605-1611, 1983), indicated loading of equal amounts of mRNA.

Northern blot analysis of MDK1 expression in adult mouse tissues wasalso performed; 10 μg total RNA of the indicated tissues were separatedand hybridized with the same probe as above. The origins of the RNAs areas follows: adult stomach, brain, testes, heart, lung, liver, kidney,spleen and muscle. The gel was stained with ethidium bromide beforeblotting to allow for comparison of equal RNA loading.

MDK1 transcripts were readily detectable as early as embryonic day 10.5.The signal intensity observed for the full-length MDK1 transcriptsdeclined during development and was barely detectable in the postnatalday 8 mouse. The intensity of the transcripts corresponding to thetruncated forms of MDK1 remained unchanged.

In adult mouse tissues, MDK1 transcripts were detected in brain andtestes and, at lesser intensity, in spleen. In brain, we identified allfive MDK1 transcripts with a prominent signal corresponding to MDK1.T1;testes and spleen, however, showed a transcript size of 3.5 kb,indicating the existence of an additional transcript variant withtissue-specific expression.

Example 3

MDK Expression Analysis by in situ Hybridization

To investigate the temporal expression of MDK1 during murinedevelopment, we used a MDK1-specific, single-stranded antisense probecorresponding to the extracellular domain (nucleotides 282 to 1,847 ofthe MDK1 sequence), which recognizes all of the different forms of MDK1.Sagittal and horizontal sections of 12.5, 14.5, 16.5, and 18.5 dayembryo and of the adult mouse brain were examined. MDK1 expression wasdetected in a variety of neuronal tissues and appeared to be verycomplex.

MDK1 expression in 12.5 p.c. and 14.5 p.c. embryos was studied. In situhybridization was studied using a (³⁵S)-DATP labelled cDNA antisenseconstruct corresponding to the extracellular domain of MDK1 on sagittaland horizontal sections of mouse embryos. Dark-field views of sagittalsections of a 12.5 p.c. and a 14.5 p.c. embryo were obtained. Sectionshybridized with the antisense probe, and control hybridization used thesense strand. Dark-field view of a horizontal section of a 12.5 p.c.embryo showed the high expression of MDK1 in the most dorsal part of theepithalamus. Higher magnification of the MDK1 expression in themesenchyme surrounding the dorsal root ganglia was also studied.Dark-field view of a sagittal section of a 14.5 p.c. embryo showed MDK1expression in the neuroepithel of the cochlea. Sections that hybridizedwith the antisense probe include: trigeminal (V) ganglion; inferiorolive; lung; dorsal root ganglia.

MDK1 expression in the 16.5 p.c. mouse embryo was also studied.Dark-field and light-field view of sections of a 16.5 p.c. mouse embryo.All sections were hybridized with the MDK1 antisense probe except for acontrol hybridization. Sagittal section of the head of the embryo andthe mouth and the kidney were obtained. MDK1 expression in the superiorcolliculus (sagittal section) and the subcommisural organ (horizontalsection) were observed. Dark-field and light-field views of thetrigeminal ganglion were taken. Dark-field and light-field view of asagittal section through a hindlimb were also taken.

MDK1 expression in the 18.5 p.c. embryo and in the adult central nervoussystem was also studied. Dark-field view of sections of an 18.5 p.c.embryo and adult brain. Sagittal sections hybridized with the antisenseor the sense probe of MDK1. Horizontal section hybridized with theantisense probe showing expression in the inferior olive. Sagittalsections probed for MDK1 expression showing strong transcription in thehabenula of the epithalamus and in the mammillary body and the pons.Antisense and control hybridization on a sagittal section of the adultbrain. MDK1 expression in the cerebellum was studied. We noted limitedexpression of MDK1 in the Purkinje cell layer. Horizontal section of theadult brain hybridized with the antisense probe for MDK1. A highermagnification of the subcommisural organ was also observed.

In the central nervous system (CNS), expression of MDK1 persisted duringthe developmental stages analyzed in the epithalamus, the thalamus, themammillary body of the hypothalamus, in the developing hippocampus, inthe dentate gyrus, and at a low level in the caudate-putamen. MDK1transcripts were detected throughout the development of the cerebellum,with transcripts in the alar plate of the metencephalon of the 12.5 dayembryo and in the cerebellar primordium of the 14.5 day embryo. MDK1 wasalso found in the tectum, in later stages of development restricted tothe superior colliculus. In addition, we observed expression in thefrontal and cingulate cortex of the 12.5 day embryo and, starting at16.5 p.c., in the neopallial cortex and in the ventricular zone of thetelencephalon. The embryonic stage E18.5 also displayed MDK1 transcriptsin the pyriform cortex. MDK1 expression was found in the pons and themedulla, with a strong signal in the inferior olive. Starting atembryonic stage E14.5, the subcommisural organ appeared as a site ofstrong MDK1 expression. Additional signals were detected in the striatumof the ganglionic eminence and in the septal nucleus of the 16.5 p.c.embryo.

The adult brain displayed a more restricted pattern of MDK1transcription. Although there appeared to be a very faint expression ofMDK1 in the whole brain, a slightly higher transcription rate wasdetected in the nucleus caudatus-putamen and in the cortex, especiallyin the granular cortex and the pyriform cortex. Regions of higher MDK1expression include the habenula, the mammillary nucleus, the anteriorolfactory nucleus and the Purkinje cell layer of the cerebellum. Inaddition, MDK1 was strongly expressed in the pyramidal cell layer of thehippocampus (CA1, CA2, CA3) as well as in the neurons of the dentategyrus and in the subcommisural organ. These areas of strong MDK1expression in the adult brain were detected in all stages of the CNSdevelopment.

In the peripheral nervous system, high levels of MDK1 expression weremaintained in the trigeminal (V) ganglion and the vestibulocochlear(VIII) ganglion. MDK1 transcripts were abundant in the neuroepitheliumof the cochlea.

Particularly in early stages of murine development, MDK1 displayed avariety of expression sites outside the nervous system, which decreasedin later stages of development both in intensity and diversity. MDK1transcripts were found in the kidney and lung, and were restricted toglomeruli and the central mesenchyme, and the segmental bronchi,terminal bronchi and bronchioles, respectively. Strong expression ofMDK1 in the limb buds appeared to be restricted to the blastemacondensations and the cartilage primordia of the limb buds and, atembryonic day 16.5, to the connective tissue surrounding the jointsbetween the metacarpal or the metatarsal and the phalanges bones.Prominent areas of MDK1 transcription were found in the developing lips,the eyelids, and the tongue, and to a lesser degree in the heart and inthe epithelial lining of the liver.

At embryonic stage E12.5, a very strong signal was identified inmesenchymal cells surrounding the spinal cord and the brain, which wasreduced through development and undetectable at embryonic day 18.5. Wealso observed strong expression of MDK1 in the primitive nasal cavityand the nasal septum, in the lateral palatine process and in themaxillary process, and, in addition, a weak signal in the precartilageprimordium of the ribs of the 12.5 p.c. embryo. Later in development,new expression sites found in the 14.5 day embryo included the primordiaof follicules of fibrissae and the primordia of the incisor and themolar teeth, with low levels of transcription detected in the intestine,at the pinna of the ear, and in the mesenchyme of the nasal capsule andaround the oesophagus and the trachea. The embryonic day 16.5 exhibitedMDK1 expression in different glands, including the submandibular gland,the mucous palatine glands, the serous glands of the nasal septum and,in the 18.5 day embryo, in the parotid gland. At this stage, there wasno expression of MDK1 in the heart, intestine or in the epitheliallining of the liver, a reduced expression level in the lung and thetongue, and predominant expression in the brain.

Example 4

Expression of MDK1 in Human 293 Fibroblasts

To investigate the tyrosine kinase activity of MDK1, we used the human293 cell transient expression system (Lammers et al., J. Biol. Chem.268:22456-22462, 1993). The different cDNAswere subcloned into acytomegalovirus promoter-based expression vector and transfected intosubconfluent 293 monolayers. Subsequently, the proteins were labelledmetabolically by incubating the cells with [³⁵S]methionine.

Immunoprecipitation of (³⁵S) methionine-labeled MDK1 from 293 cells wasperformed. 293 cells transiently transfected with MDK1, MDK1Δ1 andMDK1Δ2 expression plasmids were labeled metabolically overnight with(³⁵S) methionine. The cells were lysed and the proteins wereimmunoprecipitated with the indicated antibodies. Separation of theimmunoprecipitated proteins was done by SDS-PAGE. Autoradiographs of thegels were obtained and the molecular weights in kDa of the markers weredetermined. Untransfected 293 cells were used as a control.

Immunoprecipitation with antiserum directed against the C-term of MDK1was performed. The MDK1 variants were immunoprecipitated as above buttreated with 1,000 units of PNGase F prior SDS-PAGE as indicated toremove all asparagine linked carbohydrates. The untreated MDK1 proteinwas an additional control. Immunoprecipitation with anti-phosphotyrosineantibody 5E.2. was also performed. Cells were treated with 1 mM sodiumorthovanadate (NaOVan) to inhibit cellular protein tyrosine phosphatases(+) for 90 min before lysis. The receptor variants MDK1, MDK1Δ1, andMDK1Δ2 were immunoprecipitated using a polyclonal antibody directedagainst the carboxy-terminal 110 amino acids of MDK1. Autoradiography ofthe protein gel shows a doublet of apparent molecular weight of 120 kDand 114kD. PNGase F treatment and removal of asparagine-linkedcarbohydrates from immunoprecipitated, [³⁵S]methionine-labelled MDK1indicated that the 120 kD band corresponded to the glycosylated MDK1,which has a calculated molecular weight of 112 kD.

To investigate the ability of MDK1 to auto-phosphorylate, we treatedcells that had been transfected with appropriate vector constructs for90 min with 1 mM sodium orthovanadate prior to lysis. The[³⁵S]methionine-labelled proteins were immunoprecipitated with theantiphosphotyrosine antibody 5E.2 and resolved by SDS-PAGE. Theresulting autoradiograph shows a protein doublet of the expected size,indicating strong MDK1 autophosphorylation. Several less intenseadditional phosphotyrosine-containing protein bands in theimmunoprecipitates of sodium orthovanadate-treated cells may be cellularsubstrates of MDK1 because they could not be identified in untransfected293 cells.

The identity of the strongly labeled protein band of approximately 47 kDis not clear. This protein was not observed in untransfected cells, andimmunoprecipitations of MDK1 from mixed cell lysates of unlabelled cellstransfected with MDK1 with [³⁵S]methionine-labelled, untransfectedcontrol cells demonstrated that this protein was not acoimmunoprecipitated substrate of MDK1. Since its apparent molecularsize corresponds to the calculated molecular size of the entireintracellular domain of MDK1 (47 kD), we considered the possibility ofthe existence of a soluble catalytic kinase domain, either as a specificdegradation product or as part of a signal transduction mechanism. Sincethis band was also detected after solubilizing the cells with hotLaemmli buffer, we can exclude the possibility that it was generatedduring the experimental procedure.

The terms and expressions which have been employed are used as terms ofdescription and not of limitation, and there is no intention that in theuse of such terms and expressions of excluding any equivalents of thefeatures shown and described or portions thereof, but it is recognizedthat various modifications are possible within the scope of theinvention claimed. Thus it should be understood that although thepresent invention has been specifically disclosed by preferredembodiments and optional features, modification and variation of theconcepts herein disclosed may be resorted to by those skilled in theart, and that such modifications and variations are considered to bewithin the scope of this invention as defined by the appended claims.

12 4304 base pairs nucleic acid single linear nucleic 1 AAGCGGCCGGTCTGCAGTCG GAGACTTGCA GGCAGCAAAC ACGGTGCGAA 50 CGAACCGGAG GGGGGAGAGAGAAATCAAAC AGCTAAGCGT GGAGCAGACG 100 GCCTGGGACC CAGAAGGGGA TCGATGCGAGGAGCGCAATA ATAACAACAA 150 TAATAACCCA CTTCGGAGCA AACAGCATCT AAAGAGCTGCGACCCAACTG 200 CAGCCTAAAA AAATCAAACC TGCTCATGCA CCATGGTTGT TCAAACTCGG250 TTCCCTTCGT GGATTATTTT GTGTTACATC TGGCTGCTTG GCTTTGCACA 300CACGGGGGAG GCGCAGGCTG CGAAGGAAGT ACTATTACTG GACTCGAAAG 350 CACAACAAACAGAATTGGAA TGGATTTCCT CTCCACCCAG TGGGTGGGAA 400 GAAATTAGTG GTTTGGATGAGAACTACACT CCGATAAGAA CATACCAGGT 450 GTGCCAGGTC ATGGAGCCCA ACCAGAACAACTGGCTGCGG ACTAACTGGA 500 TTTCTAAAGG CAACGCACAA AGGATTTTTG TAGAATTGAAATTCACCTTG 550 AGGGATTGTA ATAGTCTTCC CGGAGTCCTG GGAACTTGCA AGGAAACGTT600 TAATTTGTAC TATTATGAAA CAGACTACGA CACCGGCAGG AATATACGAG 650AAAACCTTTA TGTTAAAATA GACACCATTG CTGCAGATGA AAGTTTCACA 700 CAAGGTGACCTTGGTGAAAG AAAGATGAAG CTGAACACTG AGGTGAGAGA 750 GATTGGACCT TTGTCCAAAAAGGGATTCTA TCTTGCCTTT CAGGATGTAG 800 GGGCTTGCAT AGCATTGGTT TCTGTCAAAGTGTACTACAA GAAGTGCTGG 850 ACCATTGTTG AGAACTTAGC TGTCTTTCCA GATACAGTGACTGGTTCGGA 900 ATTTTCCTCC TTAGTCGAGG TCCGTGGGAC ATGTGTCAGC AGTGCCGAGG950 AAGAGGCAGA AAATTCCCCC AGAATGCATT GCAGTGCAGA AGGAGAGTGG 1000CTAGTACCCA TTGGAAAATG CATCTGCAAA GCAGGCTATC AGCAAAAAGG 1050 GGACACTTGCGAACCCTGTG GCCGCAGGTT CTACAAATCT TCCTCTCAGG 1100 ATCTCCAGTG TTCTCGTTGTCCAACCCACA GCTTCTCTGA CCGAGAAGGA 1150 TCATCCAGGT GTGAATGTGA AGATGGGTACTACAGAGCTC CTTCTGATCC 1200 ACCATACGTT GCATGCACGA GGCCTCCCTC TGCACCACAGAACCTTATTT 1250 TCAATATCAA TCAAACGACT GTAAGTTTGG AATGGAGTCC TCCGGCTGAC1300 AACGGGGGAA GAAACGATGT CACCTACAGA ATACTGTGTA AGCGGTGCAG 1350TTGGGAACAG GGAGAATGTG TGCCATGCGG AAGTAACATT GGATACATGC 1400 CCCAGCAGACGGGATTAGAG GATAACTATG TCACTGTCAT GGACCTACTT 1450 GCCCATGCAA ATTACACTTTCGAAGTTGAA GCTGTAAATG GAGTTTCGGA 1500 CTTAAGCAGA TCCCAGAGGC TCTTCGCTGCTGTTAGCATC ACCACCGGTC 1550 AAGCAGCTCC CTCGCAAGTG AGTGGAGTCA TGAAGGAGCGAGTACTGCAG 1600 CGGAGTGTGC AGCTTTCCTG GCAGGAGCCG GAGCATCCCA ATGGAGTCAT1650 CACGGAATAT GAAATCAAGT ATTATGAGAA AGATCAACGG GAAAGGACGT 1700ACTCAACACT CAAAACCAAG TCCACCTCCG CCTCCATTAA TAATCTGAAA 1750 CCGGGAACAGTGTACGTCTT TCAGATCCGG GCGGTCACTG CTGCCGGTTA 1800 TGGAAACTAC AGCCCTAGGCTTGATGTTGC CACACTTGAG GAAGCTTCAG 1850 GTAAAATGTT TGAAGCGACA GCAGTCTCCAGTGAACAGAA TCCTGTCATC 1900 ATAATTGCTG TAGTGGCTGT AGCAGGGACC ATCATCTTGGTGTTCATGGT 1950 GTTCGGCTTC ATCATTGGAA GAAGGCACTG TGGTTATAGC AAGGCTGACC2000 AAGAAGGGGA TGAAGAACTC TACTTTCATT TTAAATTTCC AGGCACCAAA 2050ACCTACATTG ACCCTGAAAC CTATGAGGAC CCAAATAGAG CTGTCCATCA 2100 ATTCGCCAAGGAGCTAGATG CCTCCTGTAT TAAAATTGAG CGTGTGATTG 2150 GTGCAGGAGA ATTTGGAGAAGTTTGCAGTG GTCGTTTGAA ACTTCCGGGC 2200 CAGAGAGATG TTGCAGTGGC CATAAAAACCCTGAAAGTTG GTTACACAGA 2250 AAAGCAAAGG AGGGACTTTT TATGCGAAGC AAGCATCATGGGGCAATTTG 2300 ACCACCCAAA TGTCGTCCAT TTGGAAGGGG TTGTTACAAG AGGGAAGCCT2350 GTCATGATTG TGATAGAGTT CATGGAGAAT GGAGCCCTGG ATGCATTTCT 2400CAGGAAACAC GATGGGCAGT TTACAGTCAT TCAGTTGGTA GGAATGTTGA 2450 GAGGTATTGCCGCTGGGATG CGATACTTGG CTGATATGGG ATACGTTCAC 2500 AGGGACCTTG CAGCGCGCAACATCCTTGTC AACAGCAATC TTGTTTGTAA 2550 AGTGTCAGAT TTTGGCCTTT CCCGGGTTATAGAGGATGAT CCCGAAGCTG 2600 TCTACACCAC GACTGGTGGA AAAATTCCAG TAAGGTGGACTGCACCGGAA 2650 GCCATTCAAT ACCGGAAGTT CACCTCAGCC AGCGATGTGT GGAGCTATGG2700 GATTGTCATG TGGGAAGTGA TGTCTTATGG AGAAAGACCT TACTGGGACA 2750TGTCAAATCA AGATGTCATT AAAGCGATAG AAGAAGGTTA TCGTTTGCCG 2800 GCGCCCATGGATTGCCCAGC TGGTCTTCAC CAGCTAATGC TGGATTGTTG 2850 GCAGAAAGAT CGGGCGGAAAGGCCAAAGTT TGAGCAGATA GTCGGAATTC 2900 TAGACAAAAT GATTCGAAAC CCAAGTAGTCTGAAAACACC CCTGGGAACT 2950 TGTAGTAGAC CCTTAAGCCC TCTTCTGGAC CAGAGCACTCCTGACTTCAC 3000 TGCCTTCTGT TCAGTTGGAG AATGGTTGCA AGCTATTAAA ATGGAAAGGT3050 ATAAGGACAA CTTCACAGCA GCGGGTTACA ACTCACTCGA GTCAGTGGCC 3100AGGATGACTA TCGATGATGT GATGAGTTTA GGGATCACAC TGGTTGGCCA 3150 TCAAAAGAAGATCATGAGCA GCATCCAGAC TATGCGGGCA CAAATGTTGC 3200 ATTTACACGG AACAGGCATCCAAGTGTGAC ACATCGGCCT CCCTCAGATG 3250 AGGCTTAAGA CTGCAGGAGA ACAGTTCTGGCCTTCAGTAT ACGCATAGAA 3300 TGCTGCTAGA AGACAGTTGA TATACTGGGT CCTTCCTACAAGAAAGAGAA 3350 GATTTTAGAA GCACCTCCAG ACTTGAACTC CTAAGTGCCA CCAGAATATA3400 CAAAAAGGGA ATTTAGGATC CACCACTGGT GGCCAGGAAC ACAGCAGAGA 3450CAATAAACAA AGTACTACCT GAAAAACATC CCAACACCTT GAGCTCTCGA 3500 ACCTCCTTTTTATCTTATAG ACTTTTTAAA AATGTACATA AAGAATTTAA 3550 GAAAGAATAT ATTTGTCAAATAAAAATCAT GATCTTATTG TTAAAATCAA 3600 TGAAATATTT TCCTTAAAAT ATGTGATTTCAGACTATTCT TTTCCAGAAC 3650 CATCTGTGTT TATTCTGCTT AAGGACTTTG TTTTAGAAAGTTATTTGTAG 3700 CTTTGGACCT TTTTAGTGTT AAATTTATGA CACGTTACTA CACTGGGAAC3750 CTTTGAAGAC TCTCAAACTT AAAGGAAAGC AAAACTACGC ACATAGTCGA 3800GGATGGACTT TGTCCTTCAT GGCTTTGGTA TCCTGGCTGT GTCATTTTGT 3850 TAAACCAGTGATGTTTTCAT ATTGTTTGCT GATTGGCAGG TAGTTCAAAA 3900 TTGCAAGTTG CCAAGAGCTCTGATATTTTT TAACAGGATT TTTTTTTCTT 3950 TGTAAAAATC AGATAACATA CTAACTTTTCAATGAAAAAA AAAAAAAAAG 4000 AAGCAATAAT GATCCATAAA TACTATAAGG CACTTTTAACAGATTGTTTA 4050 TAGAGTGATT TACTAGGCAG AATTTAATAA AAAAAAAAGA GAGATGTCAA4100 ATTTTAGGTT TATGTGTATA TGATAAAAGG CTGAGCTTCG TCTGAAGATG 4150CTGGTGAAAG CAAGACTGGA AGCGAAGCTC TCCAGCTTTG GCTAACCCAA 4200 TCCGAGCACATCAAGAGCTT CAGTCTTGTG ACAGTAAGAA ATTTAGGAAC 4250 ATAGTTGACC TATATTTTGTATTCTTTCTT GTTGAATGCA GTCCAAATAC 4300 AAAA 4304 998 amino acids aminoacid single linear peptide 2 Met Val Val Gln Thr Arg Phe Pro Ser Trp IleIle Leu Cys Tyr Ile 1 5 10 15 Trp Leu Leu Gly Phe Ala His Thr Gly GluAla Gln Ala Ala Lys Glu 20 25 30 Val Leu Leu Leu Asp Ser Lys Ala Gln GlnThr Glu Leu Glu Trp Ile 35 40 45 Ser Ser Pro Pro Ser Gly Trp Glu Glu IleSer Gly Leu Asp Glu Asn 50 55 60 Tyr Thr Pro Ile Arg Thr Tyr Gln Val CysGln Val Met Glu Pro Asn 65 70 75 80 Gln Asn Asn Trp Leu Arg Thr Asn TrpIle Ser Lys Gly Asn Ala Gln 85 90 95 Arg Ile Phe Val Glu Leu Lys Phe ThrLeu Arg Asp Cys Asn Ser Leu 100 105 110 Pro Gly Val Leu Gly Thr Cys LysGlu Thr Phe Asn Leu Tyr Tyr Tyr 115 120 125 Glu Thr Asp Tyr Asp Thr GlyArg Asn Ile Arg Glu Asn Leu Tyr Val 130 135 140 Lys Ile Asp Thr Ile AlaAla Asp Glu Ser Phe Thr Gln Gly Asp Leu 145 150 155 160 Gly Glu Arg LysMet Lys Leu Asn Thr Glu Val Arg Glu Ile Gly Pro 165 170 175 Leu Ser LysLys Gly Phe Tyr Leu Ala Phe Gln Asp Val Gly Ala Cys 180 185 190 Ile AlaLeu Val Ser Val Lys Val Tyr Tyr Lys Lys Cys Trp Thr Ile 195 200 205 ValGlu Asn Leu Ala Val Phe Pro Asp Thr Val Thr Gly Ser Glu Phe 210 215 220Ser Ser Leu Val Glu Val Arg Gly Thr Cys Val Ser Ser Ala Glu Glu 225 230235 240 Glu Ala Glu Asn Ser Pro Arg Met His Cys Ser Ala Glu Gly Glu Trp245 250 255 Leu Val Pro Ile Gly Lys Cys Ile Cys Lys Ala Gly Tyr Gln GlnLys 260 265 270 Gly Asp Thr Cys Glu Pro Cys Gly Arg Arg Phe Tyr Lys SerSer Ser 275 280 285 Gln Asp Leu Gln Cys Ser Arg Cys Pro Thr His Ser PheSer Asp Arg 290 295 300 Glu Gly Ser Ser Arg Cys Glu Cys Glu Asp Gly TyrTyr Arg Ala Pro 305 310 315 320 Ser Asp Pro Pro Tyr Val Ala Cys Thr ArgPro Pro Ser Ala Pro Gln 325 330 335 Asn Leu Ile Phe Asn Ile Asn Gln ThrThr Val Ser Leu Glu Trp Ser 340 345 350 Pro Pro Ala Asp Asn Gly Gly ArgAsn Asp Val Thr Tyr Arg Ile Leu 355 360 365 Cys Lys Arg Cys Ser Trp GluGln Gly Glu Cys Val Pro Cys Gly Ser 370 375 380 Asn Ile Gly Tyr Met ProGln Gln Thr Gly Leu Glu Asp Asn Tyr Val 385 390 395 400 Thr Val Met AspLeu Leu Ala His Ala Asn Tyr Thr Phe Glu Val Glu 405 410 415 Ala Val AsnGly Val Ser Asp Leu Ser Arg Ser Gln Arg Leu Phe Ala 420 425 430 Ala ValSer Ile Thr Thr Gly Gln Ala Ala Pro Ser Gln Val Ser Gly 435 440 445 ValMet Lys Glu Arg Val Leu Gln Arg Ser Val Gln Leu Ser Trp Gln 450 455 460Glu Pro Glu His Pro Asn Gly Val Ile Thr Glu Tyr Glu Ile Lys Tyr 465 470475 480 Tyr Glu Lys Asp Gln Arg Glu Arg Thr Tyr Ser Thr Leu Lys Thr Lys485 490 495 Ser Thr Ser Ala Ser Ile Asn Asn Leu Lys Pro Gly Thr Val TyrVal 500 505 510 Phe Gln Ile Arg Ala Val Thr Ala Ala Gly Tyr Gly Asn TyrSer Pro 515 520 525 Arg Leu Asp Val Ala Thr Leu Glu Glu Ala Ser Gly LysMet Phe Glu 530 535 540 Ala Thr Ala Val Ser Ser Glu Gln Asn Pro Val IleIle Ile Ala Val 545 550 555 560 Val Ala Val Ala Gly Thr Ile Ile Leu ValPhe Met Val Phe Gly Phe 565 570 575 Ile Ile Gly Arg Arg His Cys Gly TyrSer Lys Ala Asp Gln Glu Gly 580 585 590 Asp Glu Glu Leu Tyr Phe His PheLys Phe Pro Gly Thr Lys Thr Tyr 595 600 605 Ile Asp Pro Glu Thr Tyr GluAsp Pro Asn Arg Ala Val His Gln Phe 610 615 620 Ala Lys Glu Leu Asp AlaSer Cys Ile Lys Ile Glu Arg Val Ile Gly 625 630 635 640 Ala Gly Glu PheGly Glu Val Cys Ser Gly Arg Leu Lys Leu Pro Gly 645 650 655 Gln Arg AspVal Ala Val Ala Ile Lys Thr Leu Lys Val Gly Tyr Thr 660 665 670 Glu LysGln Arg Arg Asp Phe Leu Cys Glu Ala Ser Ile Met Gly Gln 675 680 685 PheAsp His Pro Asn Val Val His Leu Glu Gly Val Val Thr Arg Gly 690 695 700Lys Pro Val Met Ile Val Ile Glu Phe Met Glu Asn Gly Ala Leu Asp 705 710715 720 Ala Phe Leu Arg Lys His Asp Gly Gln Phe Thr Val Ile Gln Leu Val725 730 735 Gly Met Leu Arg Gly Ile Ala Ala Gly Met Arg Tyr Leu Ala AspMet 740 745 750 Gly Tyr Val His Arg Asp Leu Ala Ala Arg Asn Ile Leu ValAsn Ser 755 760 765 Asn Leu Val Cys Lys Val Ser Asp Phe Gly Leu Ser ArgVal Ile Glu 770 775 780 Asp Asp Pro Glu Ala Val Tyr Thr Thr Thr Gly GlyLys Ile Pro Val 785 790 795 800 Arg Trp Thr Ala Pro Glu Ala Ile Gln TyrArg Lys Phe Thr Ser Ala 805 810 815 Ser Asp Val Trp Ser Tyr Gly Ile ValMet Trp Glu Val Met Ser Tyr 820 825 830 Gly Glu Arg Pro Tyr Trp Asp MetSer Asn Gln Asp Val Ile Lys Ala 835 840 845 Ile Glu Glu Gly Tyr Arg LeuPro Ala Pro Met Asp Cys Pro Ala Gly 850 855 860 Leu His Gln Leu Met LeuAsp Cys Trp Gln Lys Asp Arg Ala Glu Arg 865 870 875 880 Pro Lys Phe GluGln Ile Val Gly Ile Leu Asp Lys Met Ile Arg Asn 885 890 895 Pro Ser SerLeu Lys Thr Pro Leu Gly Thr Cys Ser Arg Pro Leu Ser 900 905 910 Pro LeuLeu Asp Gln Ser Thr Pro Asp Phe Thr Ala Phe Cys Ser Val 915 920 925 GlyGlu Trp Leu Gln Ala Ile Lys Met Glu Arg Tyr Lys Asp Asn Phe 930 935 940Thr Ala Ala Gly Tyr Asn Ser Leu Glu Ser Val Ala Arg Met Thr Ile 945 950955 960 Asp Asp Val Met Ser Leu Gly Ile Thr Leu Val Gly His Gln Lys Lys965 970 975 Ile Met Ser Ser Ile Gln Thr Met Arg Ala Gln Met Leu His LeuHis 980 985 990 Gly Thr Gly Ile Gln Val 995 610 amino acids amino acidsingle linear peptide 3 Met Val Val Gln Thr Arg Phe Pro Ser Trp Ile IleLeu Cys Tyr Ile 1 5 10 15 Trp Leu Leu Gly Phe Ala His Thr Gly Glu AlaGln Ala Ala Lys Glu 20 25 30 Val Leu Leu Leu Asp Ser Lys Ala Gln Gln ThrGlu Leu Glu Trp Ile 35 40 45 Ser Ser Pro Pro Ser Gly Trp Glu Glu Ile SerGly Leu Asp Glu Asn 50 55 60 Tyr Thr Pro Ile Arg Thr Tyr Gln Val Cys GlnVal Met Glu Pro Asn 65 70 75 80 Gln Asn Asn Trp Leu Arg Thr Asn Trp IleSer Lys Gly Asn Ala Gln 85 90 95 Arg Ile Phe Val Glu Leu Lys Phe Thr LeuArg Asp Cys Asn Ser Leu 100 105 110 Pro Gly Val Leu Gly Thr Cys Lys GluThr Phe Asn Leu Tyr Tyr Tyr 115 120 125 Glu Thr Asp Tyr Asp Thr Gly ArgAsn Ile Arg Glu Asn Leu Tyr Val 130 135 140 Lys Ile Asp Thr Ile Ala AlaAsp Glu Ser Phe Thr Gln Gly Asp Leu 145 150 155 160 Gly Glu Arg Lys MetLys Leu Asn Thr Glu Val Arg Glu Ile Gly Pro 165 170 175 Leu Ser Lys LysGly Phe Tyr Leu Ala Phe Gln Asp Val Gly Ala Cys 180 185 190 Ile Ala LeuVal Ser Val Lys Val Tyr Tyr Lys Lys Cys Trp Thr Ile 195 200 205 Val GluAsn Leu Ala Val Phe Pro Asp Thr Val Thr Gly Ser Glu Phe 210 215 220 SerSer Leu Val Glu Val Arg Gly Thr Cys Val Ser Ser Ala Glu Glu 225 230 235240 Glu Ala Glu Asn Ser Pro Arg Met His Cys Ser Ala Glu Gly Glu Trp 245250 255 Leu Val Pro Ile Gly Lys Cys Ile Cys Lys Ala Gly Tyr Gln Gln Lys260 265 270 Gly Asp Thr Cys Glu Pro Cys Gly Arg Arg Phe Tyr Lys Ser SerSer 275 280 285 Gln Asp Leu Gln Cys Ser Arg Cys Pro Thr His Ser Phe SerAsp Arg 290 295 300 Glu Gly Ser Ser Arg Cys Glu Cys Glu Asp Gly Tyr TyrArg Ala Pro 305 310 315 320 Ser Asp Pro Pro Tyr Val Ala Cys Thr Arg ProPro Ser Ala Pro Gln 325 330 335 Asn Leu Ile Phe Asn Ile Asn Gln Thr ThrVal Ser Leu Glu Trp Ser 340 345 350 Pro Pro Ala Asp Asn Gly Gly Arg AsnAsp Val Thr Tyr Arg Ile Leu 355 360 365 Cys Lys Arg Cys Ser Trp Glu GlnGly Glu Cys Val Pro Cys Gly Ser 370 375 380 Asn Ile Gly Tyr Met Pro GlnGln Thr Gly Leu Glu Asp Asn Tyr Val 385 390 395 400 Thr Val Met Asp LeuLeu Ala His Ala Asn Tyr Thr Phe Glu Val Glu 405 410 415 Ala Val Asn GlyVal Ser Asp Leu Ser Arg Ser Gln Arg Leu Phe Ala 420 425 430 Ala Val SerIle Thr Thr Gly Gln Ala Ala Pro Ser Gln Val Ser Gly 435 440 445 Val MetLys Glu Arg Val Leu Gln Arg Ser Val Gln Leu Ser Trp Gln 450 455 460 GluPro Glu His Pro Asn Gly Val Ile Thr Glu Tyr Glu Ile Lys Tyr 465 470 475480 Tyr Glu Lys Asp Gln Arg Glu Arg Thr Tyr Ser Thr Leu Lys Thr Lys 485490 495 Ser Thr Ser Ala Ser Ile Asn Asn Leu Lys Pro Gly Thr Val Tyr Val500 505 510 Phe Gln Ile Arg Ala Val Thr Ala Ala Gly Tyr Gly Asn Tyr SerPro 515 520 525 Arg Leu Asp Val Ala Thr Leu Glu Glu Ala Ser Gly Lys MetPhe Glu 530 535 540 Ala Thr Ala Val Ser Ser Glu Gln Asn Pro Val Ile IleIle Ala Val 545 550 555 560 Val Ala Val Ala Gly Thr Ile Ile Leu Val PheMet Val Phe Gly Phe 565 570 575 Ile Ile Gly Arg Arg His Cys Gly Tyr SerLys Ala Asp Gln Glu Gly 580 585 590 Asp Glu Glu Leu Tyr Phe His Ser LeuVal Thr Asn Glu His Leu Ser 595 600 605 Val Leu 610 2901 base pairsnucleic acid single linear nucleic 4 AAGCGGCCGG TCTGCAGTCG GAGACTTGCAGGCAGCAAAC ACGGTGCGAA 50 CGAACCGGAG GGGGGAGAGA GAAATCAAAC AGCTAAGCGTGGAGCAGACG 100 GCCTGGGACC CAGAAGGGGA TCGATGCGAG GAGCGCAATA ATAACAACAA150 TAATAACCCA CTTCGGAGCA AACAGCATCT AAAGAGCTGC GACCCAACTG 200CAGCCTAAAA AAATCAAACC TGCTCATGCA CCATGGTTGT TCAAACTCGG 250 TTCCCTTCGTGGATTATTTT GTGTTACATC TGGCTGCTTG GCTTTGCACA 300 CACGGGGGAG GCGCAGGCTGCGAAGGAAGT ACTATTACTG GACTCGAAAG 350 CACAACAAAC AGAATTGGAA TGGATTTCCTCTCCACCCAG TGGGTGGGAA 400 GAAATTAGTG GTTTGGATGA GAACTACACT CCGATAAGAACATACCAGGT 450 GTGCCAGGTC ATGGAGCCCA ACCAGAACAA CTGGCTGCGG ACTAACTGGA500 TTTCTAAAGG CAACGCACAA AGGATTTTTG TAGAATTGAA ATTCACCTTG 550AGGGATTGTA ATAGTCTTCC CGGAGTCCTG GGAACTTGCA AGGAAACGTT 600 TAATTTGTACTATTATGAAA CAGACTACGA CACCGGCAGG AATATACGAG 650 AAAACCTTTA TGTTAAAATAGACACCATTG CTGCAGATGA AAGTTTCACA 700 CAAGGTGACC TTGGTGAAAG AAAGATGAAGCTGAACACTG AGGTGAGAGA 750 GATTGGACCT TTGTCCAAAA AGGGATTCTA TCTTGCCTTTCAGGATGTAG 800 GGGCTTGCAT AGCATTGGTT TCTGTCAAAG TGTACTACAA GAAGTGCTGG850 ACCATTGTTG AGAACTTAGC TGTCTTTCCA GATACAGTGA CTGGTTCGGA 900ATTTTCCTCC TTAGTCGAGG TCCGTGGGAC ATGTGTCAGC AGTGCCGAGG 950 AAGAGGCAGAAAATTCCCCC AGAATGCATT GCAGTGCAGA AGGAGAGTGG 1000 CTAGTACCCA TTGGAAAATGCATCTGCAAA GCAGGCTATC AGCAAAAAGG 1050 GGACACTTGC GAACCCTGTG GCCGCAGGTTCTACAAATCT TCCTCTCAGG 1100 ATCTCCAGTG TTCTCGTTGT CCAACCCACA GCTTCTCTGACCGAGAAGGA 1150 TCATCCAGGT GTGAATGTGA AGATGGGTAC TACAGAGCTC CTTCTGATCC1200 ACCATACGTT GCATGCACGA GGCCTCCCTC TGCACCACAG AACCTTATTT 1250TCAATATCAA TCAAACGACT GTAAGTTTGG AATGGAGTCC TCCGGCTGAC 1300 AACGGGGGAAGAAACGATGT CACCTACAGA ATACTGTGTA AGCGGTGCAG 1350 TTGGGAACAG GGAGAATGTGTGCCATGCGG AAGTAACATT GGATACATGC 1400 CCCAGCAGAC GGGATTAGAG GATAACTATGTCACTGTCAT GGACCTACTT 1450 GCCCATGCAA ATTACACTTT CGAAGTTGAA GCTGTAAATGGAGTTTCGGA 1500 CTTAAGCAGA TCCCAGAGGC TCTTCGCTGC TGTTAGCATC ACCACCGGTC1550 AAGCAGCTCC CTCGCAAGTG AGTGGAGTCA TGAAGGAGCG AGTACTGCAG 1600CGGAGTGTGC AGCTTTCCTG GCAGGAGCCG GAGCATCCCA ATGGAGTCAT 1650 CACGGAATATGAAATCAAGT ATTATGAGAA AGATCAACGG GAAAGGACGT 1700 ACTCAACACT CAAAACCAAGTCCACCTCCG CCTCCATTAA TAATCTGAAA 1750 CCGGGAACAG TGTACGTCTT TCAGATCCGGGCGGTCACTG CTGCCGGTTA 1800 TGGAAACTAC AGCCCTAGGC TTGATGTTGC CACACTTGAGGAAGCTTCAG 1850 GTAAAATGTT TGAAGCGACA GCAGTCTCCA GTGAACAGAA TCCTGTCATC1900 ATAATTGCTG TAGTGGCTGT AGCAGGGACC ATCATCTTGG TGTTCATGGT 1950GTTCGGCTTC ATCATTGGAA GAAGGCACTG TGGTTATAGC AAGGCTGACC 2000 AAGAAGGGGATGAAGAACTC TACTTTCATT CTTTAGTAAC AAATGAGCAC 2050 CTGTCAGTTT TATAAACCGCAACAATAACT GTTTAAGACA ATCAATTTTG 2100 GATAAACAAT CAACTACAGC AGAATAAATCAAGATTTTTA AGTCCCATTT 2150 TCCTTTATAC ATTCTGCTTA TTTTGTTGTT ATATGTTTATTTTTTAAACT 2200 CTGATCTTGA TTGAATGTGA TACCATAAGC ACAGTTAGGC TGCAGTGTAA2250 ATATATAAAG ACATTGTTCT GAGAGCAGTA CGATTTCATG GAAAGATTGT 2300TTGGTGGCTT TGTTAAAATT AATAAAGAAT TTTTAAGGAT ATAGTGTAAT 2350 TTTCTTCATTGCATTAATAT AACCAAATAT GCCTACCTAT CTTTGTCTTG 2400 AACCAAATGA ATAGATTTGGAATACTTTAT TGTAATTGAA TTTGATATAA 2450 AGTTGACTGA GCATTTATGT GTTACCTGCATGCTTCTGGG TGCATTGAAA 2500 TATTTTAACT TTTAAAATGA TACTATGTTG TTTCAATTTTGACTACCTTT 2550 TGTGAGGCAT ACTGGCTACC TCCTCCTATT AGCTAAGATC TTCCAAAGCC2600 TTATAATGAA AAGTTTATAT AAACCATTTC TCTTTCAAAT CACTGTCATA 2650CTTGGTCACG GATCCCAGGA ATATTGTAAA TTTTCTAATT TACTCTGCAC 2700 TTTGTATATCCAGCCTCTAT TACCCTCAAG GTGAATATAA AACTATGTCT 2750 TTTGAATATT TCTCTTTGATTTTGTGATAG CAGTCCCTCA TATCTTGTAC 2800 TAATTTTATG TATATGTCAA CAGTGGTTGGTCTTTAAAAA TAAATCAAAG 2850 AATAAGTAAA AAAAAAAAAA AAAAAAAAAA AAAAATAAAAAAAAAAAAAA 2900 A 2901 626 amino acids amino acid single linear peptide5 Met Val Val Gln Thr Arg Phe Pro Ser Trp Ile Ile Leu Cys Tyr Ile 1 5 1015 Trp Leu Leu Gly Phe Ala His Thr Gly Glu Ala Gln Ala Ala Lys Glu 20 2530 Val Leu Leu Leu Asp Ser Lys Ala Gln Gln Thr Glu Leu Glu Trp Ile 35 4045 Ser Ser Pro Pro Ser Gly Trp Glu Glu Ile Ser Gly Leu Asp Glu Asn 50 5560 Tyr Thr Pro Ile Arg Thr Tyr Gln Val Cys Gln Val Met Glu Pro Asn 65 7075 80 Gln Asn Asn Trp Leu Arg Thr Asn Trp Ile Ser Lys Gly Asn Ala Gln 8590 95 Arg Ile Phe Val Glu Leu Lys Phe Thr Leu Arg Asp Cys Asn Ser Leu100 105 110 Pro Gly Val Leu Gly Thr Cys Lys Glu Thr Phe Asn Leu Tyr TyrTyr 115 120 125 Glu Thr Asp Tyr Asp Thr Gly Arg Asn Ile Arg Glu Asn LeuTyr Val 130 135 140 Lys Ile Asp Thr Ile Ala Ala Asp Glu Ser Phe Thr GlnGly Asp Leu 145 150 155 160 Gly Glu Arg Lys Met Lys Leu Asn Thr Glu ValArg Glu Ile Gly Pro 165 170 175 Leu Ser Lys Lys Gly Phe Tyr Leu Ala PheGln Asp Val Gly Ala Cys 180 185 190 Ile Ala Leu Val Ser Val Lys Val TyrTyr Lys Lys Cys Trp Thr Ile 195 200 205 Val Glu Asn Leu Ala Val Phe ProAsp Thr Val Thr Gly Ser Glu Phe 210 215 220 Ser Ser Leu Val Glu Val ArgGly Thr Cys Val Ser Ser Ala Glu Glu 225 230 235 240 Glu Ala Glu Asn SerPro Arg Met His Cys Ser Ala Glu Gly Glu Trp 245 250 255 Leu Val Pro IleGly Lys Cys Ile Cys Lys Ala Gly Tyr Gln Gln Lys 260 265 270 Gly Asp ThrCys Glu Pro Cys Gly Arg Arg Phe Tyr Lys Ser Ser Ser 275 280 285 Gln AspLeu Gln Cys Ser Arg Cys Pro Thr His Ser Phe Ser Asp Arg 290 295 300 GluGly Ser Ser Arg Cys Glu Cys Glu Asp Gly Tyr Tyr Arg Ala Pro 305 310 315320 Ser Asp Pro Pro Tyr Val Ala Cys Thr Arg Pro Pro Ser Ala Pro Gln 325330 335 Asn Leu Ile Phe Asn Ile Asn Gln Thr Thr Val Ser Leu Glu Trp Ser340 345 350 Pro Pro Ala Asp Asn Gly Gly Arg Asn Asp Val Thr Tyr Arg IleLeu 355 360 365 Cys Lys Arg Cys Ser Trp Glu Gln Gly Glu Cys Val Pro CysGly Ser 370 375 380 Asn Ile Gly Tyr Met Pro Gln Gln Thr Gly Leu Glu AspAsn Tyr Val 385 390 395 400 Thr Val Met Asp Leu Leu Ala His Ala Asn TyrThr Phe Glu Val Glu 405 410 415 Ala Val Asn Gly Val Ser Asp Leu Ser ArgSer Gln Arg Leu Phe Ala 420 425 430 Ala Val Ser Ile Thr Thr Gly Gln AlaAla Pro Ser Gln Val Ser Gly 435 440 445 Val Met Lys Glu Arg Val Leu GlnArg Ser Val Gln Leu Ser Trp Gln 450 455 460 Glu Pro Glu His Pro Asn GlyVal Ile Thr Glu Tyr Glu Ile Lys Tyr 465 470 475 480 Tyr Glu Lys Asp GlnArg Glu Arg Thr Tyr Ser Thr Leu Lys Thr Lys 485 490 495 Ser Thr Ser AlaSer Ile Asn Asn Leu Lys Pro Gly Thr Val Tyr Val 500 505 510 Phe Gln IleArg Ala Val Thr Ala Ala Gly Tyr Gly Asn Tyr Ser Pro 515 520 525 Arg LeuAsp Val Ala Thr Leu Glu Glu Ala Ser Gly Lys Met Phe Glu 530 535 540 AlaThr Ala Val Ser Ser Glu Gln Asn Pro Val Ile Ile Ile Ala Val 545 550 555560 Val Ala Val Ala Gly Thr Ile Ile Leu Val Phe Met Val Phe Gly Phe 565570 575 Ile Ile Gly Arg Arg His Cys Gly Tyr Ser Lys Ala Asp Gln Glu Gly580 585 590 Asp Glu Glu Leu Tyr Phe His Ser Leu Tyr Arg Glu Arg Gly AspGly 595 600 605 Met Glu Lys Thr Gln His Asn Lys Lys Trp Met Ile Ala SerCys Ser 610 615 620 Arg Leu 625 2323 base pairs nucleic acid singlelinear nucleic 6 AAGCGGCCGG TCTGCAGTCG GAGACTTGCA GGCAGCAAAC ACGGTGCGAA50 CGAACCGGAG GGGGGAGAGA GAAATCAAAC AGCTAAGCGT GGAGCAGACG 100 GCCTGGGACCCAGAAGGGGA TCGATGCGAG GAGCGCAATA ATAACAACAA 150 TAATAACCCA CTTCGGAGCAAACAGCATCT AAAGAGCTGC GACCCAACTG 200 CAGCCTAAAA AAATCAAACC TGCTCATGCACCATGGTTGT TCAAACTCGG 250 TTCCCTTCGT GGATTATTTT GTGTTACATC TGGCTGCTTGGCTTTGCACA 300 CACGGGGGAG GCGCAGGCTG CGAAGGAAGT ACTATTACTG GACTCGAAAG350 CACAACAAAC AGAATTGGAA TGGATTTCCT CTCCACCCAG TGGGTGGGAA 400GAAATTAGTG GTTTGGATGA GAACTACACT CCGATAAGAA CATACCAGGT 450 GTGCCAGGTCATGGAGCCCA ACCAGAACAA CTGGCTGCGG ACTAACTGGA 500 TTTCTAAAGG CAACGCACAAAGGATTTTTG TAGAATTGAA ATTCACCTTG 550 AGGGATTGTA ATAGTCTTCC CGGAGTCCTGGGAACTTGCA AGGAAACGTT 600 TAATTTGTAC TATTATGAAA CAGACTACGA CACCGGCAGGAATATACGAG 650 AAAACCTTTA TGTTAAAATA GACACCATTG CTGCAGATGA AAGTTTCACA700 CAAGGTGACC TTGGTGAAAG AAAGATGAAG CTGAACACTG AGGTGAGAGA 750GATTGGACCT TTGTCCAAAA AGGGATTCTA TCTTGCCTTT CAGGATGTAG 800 GGGCTTGCATAGCATTGGTT TCTGTCAAAG TGTACTACAA GAAGTGCTGG 850 ACCATTGTTG AGAACTTAGCTGTCTTTCCA GATACAGTGA CTGGTTCGGA 900 ATTTTCCTCC TTAGTCGAGG TCCGTGGGACATGTGTCAGC AGTGCCGAGG 950 AAGAGGCAGA AAATTCCCCC AGAATGCATT GCAGTGCAGAAGGAGAGTGG 1000 CTAGTACCCA TTGGAAAATG CATCTGCAAA GCAGGCTATC AGCAAAAAGG1050 GGACACTTGC GAACCCTGTG GCCGCAGGTT CTACAAATCT TCCTCTCAGG 1100ATCTCCAGTG TTCTCGTTGT CCAACCCACA GCTTCTCTGA CCGAGAAGGA 1150 TCATCCAGGTGTGAATGTGA AGATGGGTAC TACAGAGCTC CTTCTGATCC 1200 ACCATACGTT GCATGCACGAGGCCTCCCTC TGCACCACAG AACCTTATTT 1250 TCAATATCAA TCAAACGACT GTAAGTTTGGAATGGAGTCC TCCGGCTGAC 1300 AACGGGGGAA GAAACGATGT CACCTACAGA ATACTGTGTAAGCGGTGCAG 1350 TTGGGAACAG GGAGAATGTG TGCCATGCGG AAGTAACATT GGATACATGC1400 CCCAGCAGAC GGGATTAGAG GATAACTATG TCACTGTCAT GGACCTACTT 1450GCCCATGCAA ATTACACTTT CGAAGTTGAA GCTGTAAATG GAGTTTCGGA 1500 CTTAAGCAGATCCCAGAGGC TCTTCGCTGC TGTTAGCATC ACCACCGGTC 1550 AAGCAGCTCC CTCGCAAGTGAGTGGAGTCA TGAAGGAGCG AGTACTGCAG 1600 CGGAGTGTGC AGCTTTCCTG GCAGGAGCCGGAGCATCCCA ATGGAGTCAT 1650 CACGGAATAT GAAATCAAGT ATTATGAGAA AGATCAACGGGAAAGGACGT 1700 ACTCAACACT CAAAACCAAG TCCACCTCCG CCTCCATTAA TAATCTGAAA1750 CCGGGAACAG TGTACGTCTT TCAGATCCGG GCGGTCACTG CTGCCGGTTA 1800TGGAAACTAC AGCCCTAGGC TTGATGTTGC CACACTTGAG GAAGCTTCAG 1850 GTAAAATGTTTGAAGCGACA GCAGTCTCCA GTGAACAGAA TCCTGTCATC 1900 ATAATTGCTG TAGTGGCTGTAGCAGGGACC ATCATCTTGG TGTTCATGGT 1950 GTTCGGCTTC ATCATTGGAA GAAGGCACTGTGGTTATAGC AAGGCTGACC 2000 AAGAAGGGGA TGAAGAACTC TACTTTCATT CTCTTTACAGGGAAAGGGGA 2050 GACGGGATGG AAAAGACACA GCACAATAAG AAGTGGATGA TTGCATCGTG2100 CTCTCGTTTG TAGGTCTCTT TTCCTAATCA ACACTATGAT TTTGAAGTAC 2150GCGTACACGA AGCAAACGGG AAGAGATAAG GAATTAGCAT TGTGAACCTG 2200 ACTGTAATCCTCTCTTCCGG AAAGAGATGA GATGCTATTG CGATGAGAAT 2250 GTACAACTTG CACCTTGAAATCTTTTTTGA TAATTAGTGC TCAGGGGAGG 2300 GGGGGGGAAG TAGAGAAAGC AAA 2323 6amino acids amino acid single linear peptide 7 Ala Ala Thr Ala Ala Ala 56 amino acids amino acid single linear peptide 8 Ala Ala Thr Ala Ala Ala5 6 amino acids amino acid single linear peptide 9 His Arg Asp Leu AlaAla 5 6 amino acids amino acid single linear peptide Xaa in position 2is valine or methionine; Xaa in position 5 is phenylalanine or tyrosine.10 Asp Xaa Trp Ser Xaa Gly 5 993 amino acids amino acid single linearpeptide 11 Met Val Val Gln Thr Arg Phe Pro Ser Trp Ile Ile Leu Cys TyrIle 1 5 10 15 Trp Leu Leu Gly Phe Ala His Thr Gly Glu Ala Gln Ala AlaLys Glu 20 25 30 Val Leu Leu Leu Asp Ser Lys Ala Gln Gln Thr Glu Leu GluTrp Ile 35 40 45 Ser Ser Pro Pro Ser Gly Trp Glu Glu Ile Ser Gly Leu AspGlu Asn 50 55 60 Tyr Thr Pro Ile Arg Thr Tyr Gln Val Cys Gln Val Met GluPro Asn 65 70 75 80 Gln Asn Asn Trp Leu Arg Thr Asn Trp Ile Ser Lys GlyAsn Ala Gln 85 90 95 Arg Ile Phe Val Glu Leu Lys Phe Thr Leu Arg Asp CysAsn Ser Leu 100 105 110 Pro Gly Val Leu Gly Thr Cys Lys Glu Thr Phe AsnLeu Tyr Tyr Tyr 115 120 125 Glu Thr Asp Tyr Asp Thr Gly Arg Asn Ile ArgGlu Asn Leu Tyr Val 130 135 140 Lys Ile Asp Thr Ile Ala Ala Asp Glu SerPhe Thr Gln Gly Asp Leu 145 150 155 160 Gly Glu Arg Lys Met Lys Leu AsnThr Glu Val Arg Glu Ile Gly Pro 165 170 175 Leu Ser Lys Lys Gly Phe TyrLeu Ala Phe Gln Asp Val Gly Ala Cys 180 185 190 Ile Ala Leu Val Ser ValLys Val Tyr Tyr Lys Lys Cys Trp Thr Ile 195 200 205 Val Glu Asn Leu AlaVal Phe Pro Asp Thr Val Thr Gly Ser Glu Phe 210 215 220 Ser Ser Leu ValGlu Val Arg Gly Thr Cys Val Ser Ser Ala Glu Glu 225 230 235 240 Glu AlaGlu Asn Ser Pro Arg Met His Cys Ser Ala Glu Gly Glu Trp 245 250 255 LeuVal Pro Ile Gly Lys Cys Ile Cys Lys Ala Gly Tyr Gln Gln Lys 260 265 270Gly Asp Thr Cys Glu Pro Cys Gly Arg Arg Phe Tyr Lys Ser Ser Ser 275 280285 Gln Asp Leu Gln Cys Ser Arg Cys Pro Thr His Ser Phe Ser Asp Arg 290295 300 Glu Gly Ser Ser Arg Cys Glu Cys Glu Asp Gly Tyr Tyr Arg Ala Pro305 310 315 320 Ser Asp Pro Pro Tyr Val Ala Cys Thr Arg Pro Pro Ser AlaPro Gln 325 330 335 Asn Leu Ile Phe Asn Ile Asn Gln Thr Thr Val Ser LeuGlu Trp Ser 340 345 350 Pro Pro Ala Asp Asn Gly Gly Arg Asn Asp Val ThrTyr Arg Ile Leu 355 360 365 Cys Lys Arg Cys Ser Trp Glu Gln Gly Glu CysVal Pro Cys Gly Ser 370 375 380 Asn Ile Gly Tyr Met Pro Gln Gln Thr GlyLeu Glu Asp Asn Tyr Val 385 390 395 400 Thr Val Met Asp Leu Leu Ala HisAla Asn Tyr Thr Phe Glu Val Glu 405 410 415 Ala Val Asn Gly Val Ser AspLeu Ser Arg Ser Gln Arg Leu Phe Ala 420 425 430 Ala Val Ser Ile Thr ThrGly Gln Ala Ala Pro Ser Gln Val Ser Gly 435 440 445 Val Met Lys Glu ArgVal Leu Gln Arg Ser Val Gln Leu Ser Trp Gln 450 455 460 Glu Pro Glu HisPro Asn Gly Val Ile Thr Glu Tyr Glu Ile Lys Tyr 465 470 475 480 Tyr GluLys Asp Gln Arg Glu Arg Thr Tyr Ser Thr Leu Lys Thr Lys 485 490 495 SerThr Ser Ala Ser Ile Asn Asn Leu Lys Pro Gly Thr Val Tyr Val 500 505 510Phe Gln Ile Arg Ala Val Thr Ala Ala Gly Tyr Gly Asn Tyr Ser Pro 515 520525 Arg Leu Asp Val Ala Thr Leu Glu Glu Ala Ser Ala Thr Ala Val Ser 530535 540 Ser Glu Gln Asn Pro Val Ile Ile Ile Ala Val Val Ala Val Ala Gly545 550 555 560 Thr Ile Ile Leu Val Phe Met Val Phe Gly Phe Ile Ile GlyArg Arg 565 570 575 His Cys Gly Tyr Ser Lys Ala Asp Gln Glu Gly Asp GluGlu Leu Tyr 580 585 590 Phe His Phe Lys Phe Pro Gly Thr Lys Thr Tyr IleAsp Pro Glu Thr 595 600 605 Tyr Glu Asp Pro Asn Arg Ala Val His Gln PheAla Lys Glu Leu Asp 610 615 620 Ala Ser Cys Ile Lys Ile Glu Arg Val IleGly Ala Gly Glu Phe Gly 625 630 635 640 Glu Val Cys Ser Gly Arg Leu LysLeu Pro Gly Gln Arg Asp Val Ala 645 650 655 Val Ala Ile Lys Thr Leu LysVal Gly Tyr Thr Glu Lys Gln Arg Arg 660 665 670 Asp Phe Leu Cys Glu AlaSer Ile Met Gly Gln Phe Asp His Pro Asn 675 680 685 Val Val His Leu GluGly Val Val Thr Arg Gly Lys Pro Val Met Ile 690 695 700 Val Ile Glu PheMet Glu Asn Gly Ala Leu Asp Ala Phe Leu Arg Lys 705 710 715 720 His AspGly Gln Phe Thr Val Ile Gln Leu Val Gly Met Leu Arg Gly 725 730 735 IleAla Ala Gly Met Arg Tyr Leu Ala Asp Met Gly Tyr Val His Arg 740 745 750Asp Leu Ala Ala Arg Asn Ile Leu Val Asn Ser Asn Leu Val Cys Lys 755 760765 Val Ser Asp Phe Gly Leu Ser Arg Val Ile Glu Asp Asp Pro Glu Ala 770775 780 Val Tyr Thr Thr Thr Gly Gly Lys Ile Pro Val Arg Trp Thr Ala Pro785 790 795 800 Glu Ala Ile Gln Tyr Arg Lys Phe Thr Ser Ala Ser Asp ValTrp Ser 805 810 815 Tyr Gly Ile Val Met Trp Glu Val Met Ser Tyr Gly GluArg Pro Tyr 820 825 830 Trp Asp Met Ser Asn Gln Asp Val Ile Lys Ala IleGlu Glu Gly Tyr 835 840 845 Arg Leu Pro Ala Pro Met Asp Cys Pro Ala GlyLeu His Gln Leu Met 850 855 860 Leu Asp Cys Trp Gln Lys Asp Arg Ala GluArg Pro Lys Phe Glu Gln 865 870 875 880 Ile Val Gly Ile Leu Asp Lys MetIle Arg Asn Pro Ser Ser Leu Lys 885 890 895 Thr Pro Leu Gly Thr Cys SerArg Pro Leu Ser Pro Leu Leu Asp Gln 900 905 910 Ser Thr Pro Asp Phe ThrAla Phe Cys Ser Val Gly Glu Trp Leu Gln 915 920 925 Ala Ile Lys Met GluArg Tyr Lys Asp Asn Phe Thr Ala Ala Gly Tyr 930 935 940 Asn Ser Leu GluSer Val Ala Arg Met Thr Ile Asp Asp Val Met Ser 945 950 955 960 Leu GlyIle Thr Leu Val Gly His Gln Lys Lys Ile Met Ser Ser Ile 965 970 975 GlnThr Met Arg Ala Gln Met Leu His Leu His Gly Thr Gly Ile Gln 980 985 990Val 994 amino acids amino acid single linear peptide 12 Met Val Val GlnThr Arg Phe Pro Ser Trp Ile Ile Leu Cys Tyr Ile 1 5 10 15 Trp Leu LeuGly Phe Ala His Thr Gly Glu Ala Gln Ala Ala Lys Glu 20 25 30 Val Leu LeuLeu Asp Ser Lys Ala Gln Gln Thr Glu Leu Glu Trp Ile 35 40 45 Ser Ser ProPro Ser Gly Trp Glu Glu Ile Ser Gly Leu Asp Glu Asn 50 55 60 Tyr Thr ProIle Arg Thr Tyr Gln Val Cys Gln Val Met Glu Pro Asn 65 70 75 80 Gln AsnAsn Trp Leu Arg Thr Asn Trp Ile Ser Lys Gly Asn Ala Gln 85 90 95 Arg IlePhe Val Glu Leu Lys Phe Thr Leu Arg Asp Cys Asn Ser Leu 100 105 110 ProGly Val Leu Gly Thr Cys Lys Glu Thr Phe Asn Leu Tyr Tyr Tyr 115 120 125Glu Thr Asp Tyr Asp Thr Gly Arg Asn Ile Arg Glu Asn Leu Tyr Val 130 135140 Lys Ile Asp Thr Ile Ala Ala Asp Glu Ser Phe Thr Gln Gly Asp Leu 145150 155 160 Gly Glu Arg Lys Met Lys Leu Asn Thr Glu Val Arg Glu Ile GlyPro 165 170 175 Leu Ser Lys Lys Gly Phe Tyr Leu Ala Phe Gln Asp Val GlyAla Cys 180 185 190 Ile Ala Leu Val Ser Val Lys Val Tyr Tyr Lys Lys CysTrp Thr Ile 195 200 205 Val Glu Asn Leu Ala Val Phe Pro Asp Thr Val ThrGly Ser Glu Phe 210 215 220 Ser Ser Leu Val Glu Val Arg Gly Thr Cys ValSer Ser Ala Glu Glu 225 230 235 240 Glu Ala Glu Asn Ser Pro Arg Met HisCys Ser Ala Glu Gly Glu Trp 245 250 255 Leu Val Pro Ile Gly Lys Cys IleCys Lys Ala Gly Tyr Gln Gln Lys 260 265 270 Gly Asp Thr Cys Glu Pro CysGly Arg Arg Phe Tyr Lys Ser Ser Ser 275 280 285 Gln Asp Leu Gln Cys SerArg Cys Pro Thr His Ser Phe Ser Asp Arg 290 295 300 Glu Gly Ser Ser ArgCys Glu Cys Glu Asp Gly Tyr Tyr Arg Ala Pro 305 310 315 320 Ser Asp ProPro Tyr Val Ala Cys Thr Arg Pro Pro Ser Ala Pro Gln 325 330 335 Asn LeuIle Phe Asn Ile Asn Gln Thr Thr Val Ser Leu Glu Trp Ser 340 345 350 ProPro Ala Asp Asn Gly Gly Arg Asn Asp Val Thr Tyr Arg Ile Leu 355 360 365Cys Lys Arg Cys Ser Trp Glu Gln Gly Glu Cys Val Pro Cys Gly Ser 370 375380 Asn Ile Gly Tyr Met Pro Gln Gln Thr Gly Leu Glu Asp Asn Tyr Val 385390 395 400 Thr Val Met Asp Leu Leu Ala His Ala Asn Tyr Thr Phe Glu ValGlu 405 410 415 Ala Val Asn Gly Val Ser Asp Leu Ser Arg Ser Gln Arg LeuPhe Ala 420 425 430 Ala Val Ser Ile Thr Thr Gly Gln Ala Ala Pro Ser GlnVal Ser Gly 435 440 445 Val Met Lys Glu Arg Val Leu Gln Arg Ser Val GlnLeu Ser Trp Gln 450 455 460 Glu Pro Glu His Pro Asn Gly Val Ile Thr GluTyr Glu Ile Lys Tyr 465 470 475 480 Tyr Glu Lys Asp Gln Arg Glu Arg ThrTyr Ser Thr Leu Lys Thr Lys 485 490 495 Ser Thr Ser Ala Ser Ile Asn AsnLeu Lys Pro Gly Thr Val Tyr Val 500 505 510 Phe Gln Ile Arg Ala Val ThrAla Ala Gly Tyr Gly Asn Tyr Ser Pro 515 520 525 Arg Leu Asp Val Ala ThrLeu Glu Glu Ala Ser Gly Lys Met Phe Glu 530 535 540 Ala Thr Ala Val SerSer Glu Gln Asn Pro Val Ile Ile Ile Ala Val 545 550 555 560 Val Ala ValAla Gly Thr Ile Ile Leu Val Phe Met Val Phe Gly Phe 565 570 575 Ile IleGly Arg Arg His Cys Gly Tyr Ser Lys Ala Asp Gln Glu Gly 580 585 590 AspGlu Glu Leu Tyr Phe His Cys Thr Lys Thr Tyr Ile Asp Pro Glu 595 600 605Thr Tyr Glu Asp Pro Asn Arg Ala Val His Gln Phe Ala Lys Glu Leu 610 615620 Asp Ala Ser Cys Ile Lys Ile Glu Arg Val Ile Gly Ala Gly Glu Phe 625630 635 640 Gly Glu Val Cys Ser Gly Arg Leu Lys Leu Pro Gly Gln Arg AspVal 645 650 655 Ala Val Ala Ile Lys Thr Leu Lys Val Gly Tyr Thr Glu LysGln Arg 660 665 670 Arg Asp Phe Leu Cys Glu Ala Ser Ile Met Gly Gln PheAsp His Pro 675 680 685 Asn Val Val His Leu Glu Gly Val Val Thr Arg GlyLys Pro Val Met 690 695 700 Ile Val Ile Glu Phe Met Glu Asn Gly Ala LeuAsp Ala Phe Leu Arg 705 710 715 720 Lys His Asp Gly Gln Phe Thr Val IleGln Leu Val Gly Met Leu Arg 725 730 735 Gly Ile Ala Ala Gly Met Arg TyrLeu Ala Asp Met Gly Tyr Val His 740 745 750 Arg Asp Leu Ala Ala Arg AsnIle Leu Val Asn Ser Asn Leu Val Cys 755 760 765 Lys Val Ser Asp Phe GlyLeu Ser Arg Val Ile Glu Asp Asp Pro Glu 770 775 780 Ala Val Tyr Thr ThrThr Gly Gly Lys Ile Pro Val Arg Trp Thr Ala 785 790 795 800 Pro Glu AlaIle Gln Tyr Arg Lys Phe Thr Ser Ala Ser Asp Val Trp 805 810 815 Ser TyrGly Ile Val Met Trp Glu Val Met Ser Tyr Gly Glu Arg Pro 820 825 830 TyrTrp Asp Met Ser Asn Gln Asp Val Ile Lys Ala Ile Glu Glu Gly 835 840 845Tyr Arg Leu Pro Ala Pro Met Asp Cys Pro Ala Gly Leu His Gln Leu 850 855860 Met Leu Asp Cys Trp Gln Lys Asp Arg Ala Glu Arg Pro Lys Phe Glu 865870 875 880 Gln Ile Val Gly Ile Leu Asp Lys Met Ile Arg Asn Pro Ser SerLeu 885 890 895 Lys Thr Pro Leu Gly Thr Cys Ser Arg Pro Leu Ser Pro LeuLeu Asp 900 905 910 Gln Ser Thr Pro Asp Phe Thr Ala Phe Cys Ser Val GlyGlu Trp Leu 915 920 925 Gln Ala Ile Lys Met Glu Arg Tyr Lys Asp Asn PheThr Ala Ala Gly 930 935 940 Tyr Asn Ser Leu Glu Ser Val Ala Arg Met ThrIle Asp Asp Val Met 945 950 955 960 Ser Leu Gly Ile Thr Leu Val Gly HisGln Lys Lys Ile Met Ser Ser 965 970 975 Ile Gln Thr Met Arg Ala Gln MetLeu His Leu His Gly Thr Gly Ile 980 985 990 Gln Val

What is claimed is:
 1. An isolated, enriched, or purified MDK1polypeptide comprising amino acids of the full length amino acid sequeceset forth in MDK1.Δ1 (SEQ ID NO: 11) or MDK1.Δ2 (SEQ ID NO:12).
 2. Anisolated, enriched or purified MDK1 polypeptide, comprising the aminoacids of the full length MDK1 amino acid sequence set forth SEQ. IDNO:2.
 3. An isolated, enriched, or purified polypeptide comprising theMDK1 extracellular domain of amino acids 18 to 538 set forth in SEQ IDNO:2 and comprising the MDK1 intracellular domain of amino acids 580-998set forth in SEQ ID NO:2.
 4. An isolated, enriched, or purifiedpolypeptide comprising the MDK1 intracellular domain as set forth byamino acids 580 to 998 of SEQ ID NO:2 and at least one MDK1 domainselected from the group consisting of the MDK1 extracellular domain ofamino acids 18 to 538 set forth in SEQ ID NO:2, the MDK1 transmembranedomain of amino acids 555 to 579 set forth in SEQ ID NO:2.
 5. Anisolated, enriched, or purified polpeptide comprising the MDK1intracellular domain of amino acids 580 to 998 set forth in SEQ ID No:2.